TIGER-AI-Lab / MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
Apache License 2.0
121 stars 21 forks source link

New model | Cohere Aya Expanse #37

Closed NSbuilder closed 2 days ago

NSbuilder commented 1 week ago

https://cohere.com/blog/aya-expanse-connecting-our-world

Wyyyb commented 2 days ago

Thanks for your suggestion. We have added their evaluation results to our leaderboard.