Lightning-AI / litgpt

20+ high-performance LLM implementations with recipes to pretrain, finetune and deploy at scale.
https://lightning.ai
Apache License 2.0
8.11k stars 819 forks source link

support for older models #1153

Open qwenzo opened 3 months ago

qwenzo commented 3 months ago

Hello,

I was wondering if it is straightforward to bring older models such as GPT-2 to lit-gpt. If so, what files/configs do I need to change?

Thank you!

rasbt commented 3 months ago

Good point, and it should be. I use GPT-2 myself privately a lot as well, and it'd be nice to have it in LitGPT as well.

I think the architecture is similar to GPTNeo, so you can probably copy and adapt the GPTNeo config. The general todo list I use for adding new configs is: