issues
search
allenai
/
reward-bench
RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
281
stars
28
forks
source link
Initial generative RM implementation (via API)
#86
Closed
natolambert
closed
3 months ago
natolambert
commented
3 months ago
closes #3.
natolambert
commented
3 months ago
Waiting on this until I add Claude.
closes #3.