Open SupercarryNg opened 2 weeks ago
We use a self-developed fine-tuning framework and code, so we cannot release it. We are currently trying to use the open-source DeepSpeed for fine-tuning. If there is any progress, we will update the README as soon as possible.
We use a self-developed fine-tuning framework and code, so we cannot release it. We are currently trying to use the open-source DeepSpeed for fine-tuning. If there is any progress, we will update the README as soon as possible.
Is there any update on this? Looking forward to your release of the SFT code.
Great Work and Congraduations! Is there any plan to release a fintune example code for DeepSeek-Coder-V2? I noticed that you mentioned about finetuning this model with 8*A100 GPUs with some skills, could you be more specific? THX!