issues
search
Timothy023
/
RLMEC
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
8
stars
4
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
SFT on math data cannot get the results in the paper
#4
YJiangcm
closed
5 months ago
3
Question about the SFT method
#3
YJiangcm
closed
5 months ago
5
Reward value error while generating training data for rlmec
#2
liminghao0914
closed
10 months ago
2
code and dataset
#1
zhanghaoie
closed
10 months ago
5