how to evaluate models trained by bloom serires base model?

hkust-nlp / ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

https://cevalbenchmark.com/

MIT License

1.64k stars 78 forks source link

Closed Modas-Li closed 1 year ago

Modas-Li commented 1 year ago

please update and support this~

jxhe commented 1 year ago

maybe you can try evaluating through lm-evaluation-harness?