MLGroupJLU / LLM-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
https://arxiv.org/abs/2307.03109
1.38k stars 86 forks source link

ARB Benchmark #10

Closed kennethleungty closed 1 year ago

kennethleungty commented 1 year ago

Propose to include this new benchmark for LLMs: https://arb.duckai.org/

MLGroupJLU commented 1 year ago

Thanks for your interest in our paper.

We plan to include it in our survey, and then upload the updated version of the paper to arXiv, looking forward to your attention.

kennethleungty commented 1 year ago

Great to hear, thanks!