TIGER-AI-Lab / MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
Apache License 2.0
134 stars 23 forks source link

New model | Yi - Lightning #36

Closed NSbuilder closed 1 month ago

NSbuilder commented 1 month ago

Seems like a new chinese SOTA MoE model. The successor of yi-large from 01.AI

https://x.com/Senseye_Winning/status/1845777940338143586

Top performance on lmarena.ai ( especially math and coding, besides sonnet 3.5).

Seems to beat Qwen 2.5 72B

Glm-4-plus and Yi lightning-lite are also new and impressive.

wenhuchen commented 1 month ago

Added already to the leaderboard. Thanks for the suggestion.