Question about the gain in the paper.

KunpengLi1994 / VSRN

PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"

288 stars 47 forks source link

In paper, you said you improve 5.8% on caption retrieval and 12.6% on image retrieval. "our VSRN improves 5.8% on caption retrieval (R@1) and 12.6% on image retrieval(R@1) relatively (following the same strategy [23] of averaging predicted similarity scores of two trained models)." But the table 3 the numbers are not consistent with your claim. " SCANECCV018 [23] 67.4 90.3 95.8 48.6 77.7 85.2 VSRN (ours) 71.3 90.6 96.0 54.7 81.8 88.2"

Your results only 3.9% better on caption retrieval R@1 and 6.1% better on image retrieval R@1. Could you tell me how you get the numbers.

Thanks.

KunpengLi1994 / VSRN

Question about the gain in the paper. #6