issues
search
microsoft
/
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Other
1.89k
stars
344
forks
source link
Update yml to be valid
#427
Closed
loadams
closed
3 months ago