tjunlp-lab / Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.
638 stars 41 forks source link

How many shots are used to evaluate the benchmarks in OpenEval? #23

Open zhimin-z opened 6 months ago

zhimin-z commented 6 months ago

I can find it nowhere...