Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).
Switch data storage of ThaiExam to HF.
It's now linked to ThaiExam v1 (e.g., all splits have 5 examples for in-context learning & fixed broken examples).