issues
search
huggingface
/
nanotron
Minimalistic large language model 3D-parallelism training
Apache License 2.0
1.23k
stars
122
forks
source link
add inference for mamba
#136
Closed
3outeille
closed
7 months ago
3outeille
commented
7 months ago
https://github.com/huggingface/nanotron/pull/103
https://github.com/huggingface/nanotron/pull/103