open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
https://huggingface.co/spaces/opencompass/open_vlm_leaderboard
Apache License 2.0
1.34k stars 188 forks source link

amber benchmark #475

Closed yfzhang114 closed 1 month ago

kennymckormick commented 1 week ago

@yfzhang114

Please confirm that you can run this benchmark with at least one model properly. Seems the tsv format is not OK.