stanford-crfm / helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).
https://crfm.stanford.edu/helm
Apache License 2.0
1.8k stars 239 forks source link

Potential Inclusion of the Phi-3 model family #2692

Open bryanzhou008 opened 2 months ago

bryanzhou008 commented 2 months ago

Awesome work, would it be possible to also include the Phi-3 models from Microsoft? Thanks!

yifanmai commented 1 month ago

Thanks for the suggestion! I'll look into adding Phi-3 to the leaderboard.