deepseek-ai / DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
MIT License
821 stars 51 forks source link

Any Plan to release the code of GRPO? #29

Open Viper403 opened 3 months ago

Viper403 commented 3 months ago

The idea of GRPO is impressive. Is there any plan to release the implementation of this method? THX:)

saisurbehera commented 2 months ago

+1