allenai / reward-bench

RewardBench: the first evaluation tool for reward models.
https://huggingface.co/spaces/allenai/reward-bench
Apache License 2.0
440 stars 52 forks source link

Set up OpenRouter for llm-as-a-judge #130

Closed natolambert closed 5 months ago

natolambert commented 6 months ago

Need to get legal approval to use Gemini API, but try this https://openrouter.ai/docs#quick-start

natolambert commented 5 months ago

Closing this, didn't realize it is also a paid service. Would require approvals.