issues
search
allenai
/
RL4LMs
A modular RL library to fine-tune language models to human preferences
https://rl4lms.apps.allenai.org/
Apache License 2.0
2.13k
stars
191
forks
source link
In the paper, what is the detail setting of supervised learning? Is SL has additional supervised data?
#49
Open
guotong1988
opened
1 year ago
guotong1988
commented
1 year ago
https://openreview.net/forum?id=8aHzds2uUyB
Thank you very much!
https://openreview.net/forum?id=8aHzds2uUyB
Thank you very much!