bigscience-workshop / Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2
Other
1.31k stars 213 forks source link

[bloom inference scripts] improvements #345

Closed stas00 closed 2 years ago

stas00 commented 2 years ago

further tweaking of the scripts - updating the stats