deepseek-ai / DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
MIT License
783 stars 46 forks source link

My environment is something wrong with flash-atten, can I drop it when finetune DeepSeek-Math? #28

Open AceCHQ opened 1 month ago

AceCHQ commented 1 month ago

Hello, there is something wrong with flash-attn, can I drop it when I finetune DeepSeek-Math? Will it destroy the performance of the model? Thank you.