Catastrophic forgetting problem

deepseek-ai / DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

https://coder.deepseek.com/

MIT License

6.6k stars 461 forks source link

Catastrophic forgetting problem #134

Open shatealaboxiaowang opened 6 months ago

shatealaboxiaowang commented 6 months ago

Hi dear:

Thanks for your open source, but when i finetuned (whatever full parameters or LoRa ) on my dataset, catastrophic forgetting kept coming up (decrease in performance on the humaneval), i do not know how to solve it, do you have any tops?

guoday commented 6 months ago

If you fine-tune on the Instruct model, I think such a phenomenon is normal. There are two reasons for this: one is that your data is inferior compared to the original data used to fine-tune the Instruct model, and the other reason is that further fine-tuning on the Instruct model could lead to overfitting.

shatealaboxiaowang commented 6 months ago

Thx, i will continue to check my data quality.