Closed clefourrier closed 3 months ago
Let's now use this implementation from the authors (ported by @lewtun ): https://huggingface.co/datasets/google/IFEval
I also updated the one in the Eleuther harness for consistency: https://github.com/EleutherAI/lm-evaluation-harness/pull/2218
Evaluation short description
Let's now use this implementation from the authors (ported by @lewtun ): https://huggingface.co/datasets/google/IFEval