bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
374 stars 49 forks source link

Support flash attn 2 #72

Closed jlamypoirier closed 1 year ago