huggingface / lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
MIT License
471 stars 55 forks source link

Add MT-Bench #88

Closed NathanHB closed 3 months ago

NathanHB commented 4 months ago

Adds MT-Bench, will also add llm-as-judge metric and multi turn evals.

Original codebase is here