open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
https://huggingface.co/spaces/opencompass/open_vlm_leaderboard
Apache License 2.0
1.08k stars 154 forks source link

OpenVLM Leaderboard分类问题 #424

Closed bowcr closed 4 weeks ago

bowcr commented 4 weeks ago

MME Leaderboard 的Cognition分类是不是应该对应的reasoning? 代码跑出来结果 其他分类字段都对得上,只有这个不同