can you provide the image subsets you used for evaluation because the whole set of gqa and coco is so large

DAMO-NLP-SG / VCD

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Apache License 2.0

196 stars 9 forks source link

Closed yfzhang114 closed 9 months ago

LengSicong commented 9 months ago

Thanks for your interest.

Unfortunately, we also downloaded the whole dataset and read images from that. You can find all image ids used in COCO and GQA here.