Liuhong99 / Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
MIT License
938 stars 52 forks source link

Update prepare.py #14

Open yhgon opened 1 year ago

yhgon commented 1 year ago

just avoid error in cache dir.