clue-ai / ChatYuan

ChatYuan: Large Language Model for Dialogue in Chinese and English
https://www.clueai.cn
Other
1.9k stars 183 forks source link

分布式训练 #20

Closed flower-with-safe closed 1 year ago

flower-with-safe commented 1 year ago

主要修改: 1.optimizer参照论文修改为Adafactor,节约显存。 2.加入horovod分布式训练。 3.手动实现accumulation steps,扩大batch size。