Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).
Hi folks, thanks for the work here! I'm trying to use the
HEIMHumanEvalScenario
class but run into an error afterget_instances()
is called. Specifically,https://worksheets.codalab.org/rest/bundles/0x502d646c366c4f1d8c4a2ccf163b958f/contents/blob/
returnsERROR 403: Forbidden
when usingwget
to download data during theHEIMHumanEvalScenario.get_instances()
call: https://github.com/stanford-crfm/helm/blob/60a58658b4648d7b027869a6b22fec1af2eaa855/src/helm/benchmark/scenarios/vision_language/heim_human_eval_scenario.py#L62Visiting the URL manually shows a forbidden error specifying that read permissions are needed. Is this intended behavior?