issues
search
HuangLK
/
transpeeder
train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism
Apache License 2.0
208
stars
18
forks
source link
remove unused arg
#12
Closed
HuangLK
closed
1 year ago