bigcode-project / starcoder2

Home of StarCoder2!
Apache License 2.0
1.71k stars 158 forks source link

Inquiry about Fine-Tuning Using Custom Code #19

Open tclxmeng-jia opened 5 months ago

tclxmeng-jia commented 5 months ago

Hi there,

I hope this message finds you well. I am currently exploring the process of fine-tuning models using my own codebase, and I was hoping to seek some guidance on this matter.

Could you please provide me with information on how I can effectively fine-tune models using my own codebase? Additionally, would it be possible for you to share any scripts or resources related to data preprocessing for this purpose?

I truly appreciate any assistance or insights you can provide on this matter. Thank you very much for your time and support.

Best regards @loubnabnl

loubnabnl commented 5 months ago

You can check this blog post for fine-tuning code models on personal codebases: https://huggingface.co/blog/personal-copilot For data processing you can find the scripts we used at https://github.com/bigcode-project/bigcode-dataset/ and details in the StarCoder papers.