issues
search
microsoft
/
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Other
1.9k
stars
345
forks
source link
Extend test utilities to support more accelerators
#418
Closed
xinyu-intel
closed
4 months ago