Timothy023 RLMEC issues - Githubissues

Timothy023 / RLMEC

The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"

8 stars 4 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

SFT on math data cannot get the results in the paper

#4 YJiangcm closed 5 months ago
3
Question about the SFT method

#3 YJiangcm closed 5 months ago
5
Reward value error while generating training data for rlmec

#2 liminghao0914 closed 10 months ago
2
code and dataset

#1 zhanghaoie closed 10 months ago
5