argonne-lcf / Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2
Other
9 stars 12 forks source link

Performance measurement and profiling of 7B and 70B models on different systems #42

Open zhenghh04 opened 3 months ago