EleutherAI / pythia

The hub for EleutherAI's work on interpretability and learning dynamics
Apache License 2.0
2.23k stars 165 forks source link

Can I provide custom data and continue training Pythia on this new data? #113

Closed GeorgiAngelov closed 1 year ago

GeorgiAngelov commented 1 year ago

Can I provide custom data and continue training Pythia on this new data? I have data in a specific domain + language and I would like to use Pythia as the base model and simply continue training it with this additional new data. Is this how this would work or do I need to completely start from scratch ( i.e. traning a Pythia from the ground up with this new data ) ?

StellaAthena commented 1 year ago

Yes you can! You’ll need to either use a pre-final checkpoint or warm up the learning rate again, as the learning rate for the final checkpoint is very small. However you can absolutely finetune Pythia models, and many people have already done so both within EleutherAI and at other orgs.