Closed Modas-Li closed 1 year ago
please update and support this~
maybe you can try evaluating through lm-evaluation-harness?
https://github.com/hkust-nlp/ceval#use-through-evaluation-harness
please update and support this~