huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
MIT License
832 stars 99 forks source link

refacto judge and add mixeval #337

Closed NathanHB closed 1 month ago

NathanHB commented 1 month ago

What this PR does: