allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences
https://rl4lms.apps.allenai.org/
Apache License 2.0
2.13k stars 191 forks source link

is multi-dimensional reward supported? #60

Open zabir-nabil opened 1 year ago

zabir-nabil commented 1 year ago

Hi, thanks for publishing this awesome library. Can I add a configuration / modify the reward.py to return a vector instead of a scalar reward?