deepseek-ai / DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
MIT License
783 stars 46 forks source link

Any Plan to release the code of GRPO? #29

Open Ye-Yang-SDUWH opened 1 month ago

Ye-Yang-SDUWH commented 1 month ago

The idea of GRPO is impressive. Is there any plan to release the implementation of this method? THX:)

saisurbehera commented 3 weeks ago

+1