bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
376 stars 49 forks source link

re-merge from NVIDIA main #64

Closed RaymondLi0 closed 1 year ago