onejune2018 / Awesome-LLM-Eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
MIT License
396 stars 39 forks source link

Add MMLU-Pro #8

Open hobbytp opened 3 months ago

hobbytp commented 3 months ago

MMLU-Pro info is as below: 1.paper: https://arxiv.org/pdf/2406.01574 2.huggingface: https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro 3.GitHub: https://github.com/TIGER-AI-Lab/MMLU-Pro

onejune2018 commented 2 months ago

MMLU-Pro info is as below: 1.paper: https://arxiv.org/pdf/2406.01574 2.huggingface: https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro 3.GitHub: https://github.com/TIGER-AI-Lab/MMLU-Pro

Thanks for your comments, we will add MMLU-Pro : )