allenai / OLMo

Modeling, training, eval, and inference code for OLMo
https://allenai.org/olmo
Apache License 2.0
4.37k stars 431 forks source link

sharded ckpt is saved only for fsdp #665

Closed ananyahjha93 closed 1 month ago

ananyahjha93 commented 1 month ago

Fixes #664 .