open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
https://opencompass.org.cn/
Apache License 2.0
4.18k stars 446 forks source link

[Feature] Update Math data #1700

Closed MaiziXiao closed 1 week ago

MaiziXiao commented 1 week ago
  1. Add test_prm800k_500.json subset into auto-download OSS
  2. Update Math Dataset config to support different file_name