modal-labs / llm-finetuning

Guide for fine-tuning Llama/Mistral/CodeLlama models and more
MIT License
538 stars 84 forks source link

Update axolotl image and other dependencies #28

Closed mwaskom closed 9 months ago

mwaskom commented 9 months ago

The primary change here is to update the version of the axolotl container to correspond to the v0.4.0 release. There are also some changes directly downstream of that:

Additionally, I've made some updates to the configs that aren't strictly related to the axolotl version, but arose from the testing that I was doing:

Finally, I updated some of the CI that I added in a previous PR:

Despite being fairly lightweight and taking just a couple of minutes, the models that train in CI seem pretty good! (evaluation loss of ≈0.06 for Mistral).