allenai / reward-bench

RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
440 stars 52 forks source link

New week new models #85

Closed natolambert closed 7 months ago

natolambert commented 8 months ago

Mostly closes #84 (i figured the DPO only models will be similar so didn't add for now) closes #83