bigcode-project / Megatron-LM

Ongoing research training transformer models at scale
Other
376 stars 49 forks source link

Add Deepspeed integration [WIP] #62

Closed mayank31398 closed 3 months ago