open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
https://huggingface.co/spaces/opencompass/open_vlm_leaderboard
Apache License 2.0
1.34k stars 188 forks source link

Reproducing QWen2VL Results on Video Benchmarks with VLMEvalKit #484

Open aniki-ly opened 1 month ago

aniki-ly commented 1 month ago

Thanks for the evaluation toolkit. Could you provide scripts to reproduce QWen2VL results on Video Benchmarks?

luohao123 commented 1 month ago

It actually can not reproduced...

kennymckormick commented 1 month ago

@aniki-ly @luohao123

Qwen-VL Team is working on fixing the issues.

jun0yayay commented 3 weeks ago

@aniki-ly @luohao123

Qwen-VL Team is working on fixing the issues.

hi, now this code can do that?

kennymckormick commented 1 week ago

@FangXinyu-0913 Has this issue already been solved?