huggingface / lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
MIT License
467 stars 54 forks source link

Add AGIEval #121

Closed clefourrier closed 3 months ago

clefourrier commented 3 months ago

Compared results with YALL, within stderr range. Closes #79