issues
search
allenai
/
reward-bench
RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
442
stars
52
forks
source link
Plot distribution of RM scores for each RM
#67
Closed
natolambert
closed
8 months ago