allenai / reward-bench

RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
442 stars 52 forks source link

Improve run_generative documentation + add to pip #124

Closed natolambert closed 6 months ago

natolambert commented 6 months ago

Closes #122, will need to do a new release to make it available.