issues
search
bigscience-workshop
/
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Other
1.31k
stars
213
forks
source link
[Bloom inference] further improvements
#344
Closed
stas00
closed
2 years ago
stas00
commented
2 years ago
messed up
messed up