is multi-dimensional reward supported?

allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences

https://rl4lms.apps.allenai.org/

Apache License 2.0

2.13k stars 191 forks source link

Open zabir-nabil opened 1 year ago

zabir-nabil commented 1 year ago

Hi, thanks for publishing this awesome library. Can I add a configuration / modify the reward.py to return a vector instead of a scalar reward?