issues
search
deepseek-ai
/
DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
MIT License
783
stars
46
forks
source link
GRPO as part of HF TRL?
#26
Open
idobenshaul10
opened
2 months ago
idobenshaul10
commented
2 months ago
Would be cool to see this compared to other methods
Would be cool to see this compared to other methods