bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
371 stars 48 forks source link

Diff with nvidia main #84

Open jlamypoirier opened 8 months ago