allenai / reward-bench

RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
442 stars 52 forks source link

[Core team] Migrate Prior Sets to 50% weight #87

Closed natolambert closed 7 months ago

natolambert commented 8 months ago

Fixes:

cc @ljvmiranda921

natolambert commented 7 months ago

Closed with #105