onejune2018 / Awesome-LLM-Eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
MIT License
396 stars 40 forks source link

Refactor leaderboard #4

Closed zhimin-z closed 9 months ago

zhimin-z commented 10 months ago

Rather than use static leaderboards that might soon be outdated, we include external links to redirect readers to the updated information. @yzbx @onejune2018 @ChanLiang