open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
https://opencompass.org.cn/
Apache License 2.0
4.18k stars 446 forks source link

[CI] update torch version and add more datasets into daily testcase #1701

Closed zhulinJulia24 closed 5 days ago

zhulinJulia24 commented 1 week ago
  1. add fullbench v1.2 and v1.3' s dataset
  2. update torch version becuase of vllm's upgrade