lezhang7 / Enhance-FineGrained

[CVPR' 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding
Other
43 stars 1 forks source link

recall@5 COCO image-to-text (table 5) #19

Open FiorenzoParascandolo1 opened 1 month ago

FiorenzoParascandolo1 commented 1 month ago

I'm trying to calculate the recall@5 on COCO image-to-text (table 5). I have some questions since I'm not getting the same results:

I tried using the Karpathy split test. I computed the recall@5 considering the recall@5 described above but I get the following results:

CLIP (openai): 77.2 vs 75 in the paper CE-CLIP: 75.5 vs 74.3 in the paper

lezhang7 commented 1 month ago

Hi,

For benchmark coco retreival, we directly use https://github.com/LAION-AI/CLIP_benchmark/tree/main toolkit.