CLUEbenchmark / SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
https://www.superclueai.com
3.02k stars 97 forks source link

作为一个测评榜,建议参考Chinese-LLaMA-Alpaca进行适度的测评说明和公开 #9

Open shm007g opened 1 year ago

shm007g commented 1 year ago

参考 https://github.com/ymcui/Chinese-LLaMA-Alpaca/blob/main/examples/README.md

image

王自如评测锤子手机,说他们测评了1000多项,只是不能公开说每项测评的是啥...

brightmart commented 1 year ago

感谢反馈,稍后加上。