New model | Cohere Aya Expanse

TIGER-AI-Lab / MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Apache License 2.0

121 stars 21 forks source link

Closed NSbuilder closed 2 days ago

NSbuilder commented 1 week ago

Wyyyb commented 2 days ago

Thanks for your suggestion. We have added their evaluation results to our leaderboard.