allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences
https://rl4lms.apps.allenai.org/
Apache License 2.0
2.13k stars 191 forks source link

In the paper, what is the detail setting of supervised learning? Is SL has additional supervised data? #49

Open guotong1988 opened 1 year ago

guotong1988 commented 1 year ago

https://openreview.net/forum?id=8aHzds2uUyB

Thank you very much!