Open g-i-o-r-g-i-o opened 2 years ago
Is there any resource that explains how to train from scratch? I'm interested on building geppetto on a smaller dataset. I suppose that italian requires a lot of work with the tokenizers??
Hi @GianniGi check this out: https://huggingface.co/blog/how-to-train
I think it pretty much covers it up.
Is there any resource that explains how to train from scratch? I'm interested on building geppetto on a smaller dataset. I suppose that italian requires a lot of work with the tokenizers??