issues
search
Liuhong99
/
Sophia
The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
MIT License
931
stars
52
forks
source link
How to run sophia optimizer with huggingface trainer.
#32
Closed
Dominic789654
closed
11 months ago