huggingface / smollm

Everything about the SmolLM & SmolLM2 family of models
https://huggingface.co/HuggingFaceTB
Apache License 2.0
1.09k stars 42 forks source link

Code for finetuning the instruct models #2

Open bil-ash opened 2 weeks ago

bil-ash commented 2 weeks ago

@loubnabnl The code for finetuning the base models are provided in this repo. However, I guess that the same code cannot be used for finetuning the instruct models. So, please provide code for finetuning the instruct models.

loubnabnl commented 4 days ago

Hi, the current script should work if you change the dataset content column with properly formatted with ChatML conversations. But we added the recipe for building SmolLM2 instruct with the alignment handbook here https://github.com/huggingface/alignment-handbook/blob/main/recipes/smollm2/README.md