bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
371 stars 48 forks source link

Want explanation of the MQA related code #80

Closed hyunwoongko closed 12 months ago

hyunwoongko commented 12 months ago

I confused it. currently I understand all details of the implementation. closing isaue...