OpenGVLab / MMT-Bench

ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
81 stars 2 forks source link

How to parse out single images in each sample? #2

Open gd2016229035 opened 3 months ago

gd2016229035 commented 3 months ago

Great job! we found that the 'image' field in MMT-Bench_VAL.tsv may contain multiple images in each sample. How can we separate and parse out single images from it?

KainingYing commented 3 months ago

Thanks for your attention!

We follow the VLMEvalKit to build the MMT-Bench_VAL.tsv.

image

For the detail, you can refer to https://github.com/open-compass/VLMEvalKit/blob/main/Development.md.