bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
371 stars 48 forks source link

re-merge from NVIDIA main #68

Open RaymondLi0 opened 1 year ago

RaymondLi0 commented 1 year ago

Among other things, fixes a backward compatibility issue of the checkpoint merging tools introduced by the previous merge.