[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
196
stars
9
forks
source link
can you provide the image subsets you used for evaluation because the whole set of gqa and coco is so large #3
Closed
yfzhang114 closed 9 months ago
Thanks for your interest.
Unfortunately, we also downloaded the whole dataset and read images from that. You can find all image ids used in COCO and GQA here.