allenai / reward-bench

RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
281 stars 28 forks source link

Minor run_rm.py fixes #97

Closed PavelCz closed 3 months ago

PavelCz commented 3 months ago

Looking forward to using this project!

Here are some minor changes I made to be able to run run_rm.py locally:

natolambert commented 3 months ago

Thx @PavelCz -- looks great. Any chance you can add to run_dpo.py too? Trying to keep those scripts in sync.

PavelCz commented 3 months ago

@natolambert, I added the same changes to run_dpo.py. I can run that file locally as well now.

natolambert commented 3 months ago

LGTM. Should be able to merge once the workflows run. I'll probably handle it! Thanks!