Open satheeshkatipomu opened 6 months ago
No plans...
No plans...
why,we have the needs
We are also very interested to do this. So far our experiments have been unsuccessful. It would be incredible if we could get some clues.
We are also very interested to do this. So far our experiments have been unsuccessful. It would be incredible if we could get some clues.
what is the problem? i have use some repo like MFTCoder/Firefly to train it successful except for the eval during training; and loss decrease normal. These training pipeline will be ok?
We are also very interested to do this. So far our experiments have been unsuccessful. It would be incredible if we could get some clues. I have record the bugs during match deepseek-v2 to training repos can we have a wechat to communicate these cases? my : yiyepiaoling0715
like this
Hi,
Can you please give us instructions about fine-tuning deepseekv2 model? Can we use
finetune.py
script fromDeepSeek-MoE
https://github.com/deepseek-ai/DeepSeek-MoE/blob/main/finetune/finetune.py