Open fakerybakery opened 3 months ago
We just released OLMo integration into the transformers library (v4.40.0 and up), with corresponding -hf
checkpoints on Huggingface Hub (e.g. https://huggingface.co/allenai/OLMo-1.7-7B-hf). I haven't tried gradient checkpointing there, but it may work.
I confirmed it does not work. This would a great addition.
Hi, I'm trying to finetune OLMo but running into the error
ValueError: OLMoForCausalLM does not support gradient checkpointing.
Is this planned in the future?Thanks for releasing OLMo!