stanford-crfm / helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).
https://crfm.stanford.edu/helm
Apache License 2.0
1.89k stars 243 forks source link

Are we able to feed HELM a table of LLM input/output instead of connecting to a model? #2779

Closed mzahorec closed 3 months ago

mzahorec commented 3 months ago

Hello, I have spreadsheets of input/output from language models. Is there any relatively straightforward way to run HELM metrics on data in this format? (That is, instead of connecting HELM to a language model and grabbing the output via API.) I was not able to find any mentions of this sort of task in any of the documentation or previous issues. Thank you.

yifanmai commented 3 months ago

This use case is currently not officially supported, unfortunately.

You consider try one of these approaches, which are not officially supported:

mzahorec commented 3 months ago

I will try out these approaches. Thank you for your help!! @yifanmai