Closed Modas-Li closed 10 months ago
please update and support this~
maybe you can try evaluating through lm-evaluation-harness?
https://github.com/hkust-nlp/ceval#use-through-evaluation-harness
please update and support this~