deepseek-ai / DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
MIT License
1.19k stars 62 forks source link

Any plan to release the fintune example? #5

Open SupercarryNg opened 2 weeks ago

SupercarryNg commented 2 weeks ago

Great Work and Congraduations! Is there any plan to release a fintune example code for DeepSeek-Coder-V2? I noticed that you mentioned about finetuning this model with 8*A100 GPUs with some skills, could you be more specific? THX!

guoday commented 1 week ago

We use a self-developed fine-tuning framework and code, so we cannot release it. We are currently trying to use the open-source DeepSpeed for fine-tuning. If there is any progress, we will update the README as soon as possible.

fengyang95 commented 22 hours ago

We use a self-developed fine-tuning framework and code, so we cannot release it. We are currently trying to use the open-source DeepSpeed for fine-tuning. If there is any progress, we will update the README as soon as possible.

Is there any update on this? Looking forward to your release of the SFT code.

guoday commented 5 hours ago

you can refer to the following links: https://github.com/datawhalechina/self-llm/blob/master/DeepSeek-Coder-V2/04-DeepSeek-Coder-V2-Lite-Instruct%20Lora%20%E5%BE%AE%E8%B0%83.md https://github.com/DYF-AI/custom-swift