allenai / OLMo

Modeling, training, eval, and inference code for OLMo
https://allenai.org/olmo
Apache License 2.0
4.2k stars 392 forks source link

Make hf_olmo support AutoModelForCausalLM #631

Closed 2015aroras closed 1 week ago

2015aroras commented 2 weeks ago

hf_olmo was conflicting with transformers because they were both claiming model_type=olmo. Changing hf_olmo's model type to something else seems to make all our problems go away...

We need to go change all our uploaded models' checkpoints before we merge this though, but thankfully having the wrong model type in config before this gets merged is not a breaking issue (only causes warnings).