open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
https://opencompass.org.cn/
Apache License 2.0
4.18k stars 446 forks source link

Add RULER 64k #1709

Closed changlan closed 22 hours ago

changlan commented 3 days ago

TESTED: opencompass --datasets ruler_64k_gen --hf-type base --hf-path facebook/opt-125m