deepseek-ai / DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
MIT License
783 stars 46 forks source link

GRPO as part of HF TRL? #26

Open idobenshaul10 opened 2 months ago

idobenshaul10 commented 2 months ago

Would be cool to see this compared to other methods