huggingface / lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
MIT License
467 stars 54 forks source link

adding aimo custom eval #154

Closed NathanHB closed 2 months ago

NathanHB commented 2 months ago

@lewtun Do you still need this / do we want to merge it or put it in official tasks ?

lewtun commented 2 months ago

@lewtun Do you still need this / do we want to merge it or put it in official tasks ?

Left some final nits and then I think it can be merged!