allenai / OLMo

Modeling, training, eval, and inference code for OLMo
https://allenai.org/olmo
Apache License 2.0
4.79k stars 488 forks source link

Annealing configs #738

Closed dirkgr closed 4 weeks ago

dirkgr commented 1 month ago

This is mainly a bunch of annealing configs that right now live in an obscure branch. There are also run scripts to make these work in various scenarios.

dirkgr commented 4 weeks ago

It's accumulated over quite a long time. I think we may want to cull these at some point.