hkust-nlp / ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
https://cevalbenchmark.com/
MIT License
1.64k stars 78 forks source link

how to evaluate models trained by bloom serires base model? #48

Closed Modas-Li closed 1 year ago

Modas-Li commented 1 year ago

please update and support this~

jxhe commented 1 year ago

maybe you can try evaluating through lm-evaluation-harness?

https://github.com/hkust-nlp/ceval#use-through-evaluation-harness