loubnabnl / santacoder-finetuning

Fine-tune SantaCoder for Code/Text Generation.
Apache License 2.0
179 stars 22 forks source link

I only generated three files after fine-tuning, is it correct? Why is there no tokenizer.json file? #17

Closed chen-lee-li closed 11 months ago

chen-lee-li commented 1 year ago

image

loubnabnl commented 1 year ago

Yes that's expected you can get the tokenizer files from SantaCoder's repo (see readme)