Saving bug (non breaking)

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Apache License 2.0

281 stars 28 forks source link

Closed natolambert closed 3 months ago

natolambert commented 3 months ago

We don't use our own sub_path correctly in saving results. It works, but is confusing. See:

Tbh i'm surprised the code still works as expected (lol)