allenai / reward-bench

RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
440 stars 52 forks source link

Fix EOS token bug on FastChat models (non DPO) #94

Closed natolambert closed 7 months ago

natolambert commented 7 months ago

Closes #90 Effected models to re-run (prev score --> new score):

These had a stronger effect on Tulu models, interestingly.