issues
search
allenai
/
reward-bench
RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
440
stars
52
forks
source link
Add name substitution to benchmark results
#64
Closed
ljvmiranda921
closed
8 months ago