Open kadirnar opened 1 month ago
cc @eliebak
Hey, thanks for your message, the tokenizer is here HuggingFaceTB/cosmo2-tokenizer, will add it to the model card thx for noticing it! :)
@eliebak Could you check the link again?(404 Error) Could you add a sample training code?
Should work now sorry (you also have a tokenizer file in the model btw). For the training code we will release it soon, in the main time there is some example on how you can train a model with nanotron here https://github.com/huggingface/nanotron/tree/main/examples :)
Hi,
I want to train the SmolLM model. I couldn't find any information about the tokenizer on the blog or model cards. Can you help me with the training code?
Example Code: