Open Viper403 opened 3 months ago
Congratulations to Qwen team! Another outstanding job!
I noticed that you use GRPO to RL your math model. For now, there is no released implementation of GRPO. Do you have any plan to release the code?
Thank you very much!
+1
Congratulations to Qwen team! Another outstanding job!
I noticed that you use GRPO to RL your math model. For now, there is no released implementation of GRPO. Do you have any plan to release the code?
Thank you very much!