issues
search
argonne-lcf
/
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Other
9
stars
12
forks
source link
Performance measurement and profiling of 7B and 70B models on different systems
#42
Open
zhenghh04
opened
3 months ago