open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
https://huggingface.co/spaces/opencompass/open_vlm_leaderboard
Apache License 2.0
1.08k stars 154 forks source link

上传新的benchmark #415

Closed ttguoguo3 closed 3 weeks ago

ttguoguo3 commented 1 month ago

我将新的benchmark:MM-NIAH的tsv格式数据存储到了https://huggingface.co/datasets/petter12321/MM-NIAH-VLMEvalKit/tree/main,其中MM_NIAH_VAL.tsv文件是完整的,而MM_NIAH_TEST.tsv文件被拆分成了part-aa~e五个文件,请将这五个文件组合成MM_NIAH_TEST.tsv文件而后将这两个tsv文件上传到服务器中,谢谢!

kennymckormick commented 3 weeks ago

Working.