issues
search
allenai
/
reward-bench
RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
442
stars
52
forks
source link
[Core team] Migrate Prior Sets to 50% weight
#87
Closed
natolambert
closed
7 months ago
natolambert
commented
8 months ago
Fixes:
[x] Leaderboard loading / display
[x] Repo loading / table printing (for paper)
[x] Replace all column names "Average" with "Score"
[x] Update paper
cc @ljvmiranda921
natolambert
commented
7 months ago
Closed with #105
Fixes:
cc @ljvmiranda921