Set lower_is_better to false for AIR-Bench scores

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

https://crfm.stanford.edu/helm

Apache License 2.0

1.89k stars 243 forks source link

Set lower_is_better to false for AIR-Bench scores #2788

Closed yifanmai closed 3 months ago

yifanmai commented 3 months ago

The judge prompts have been changed so that a score of 1 indicates safe rather than unsafe responses, and a score of 0 indicates unsafe rather than safe responses.