Hello, there.
I have question about table 2. I am confused AP in Table 2 is accuracy whether metric(ALOHa, CLIPScore) would give better score to "not FOIL" one over FOIL, following procedure in CLIPScore paper https://arxiv.org/pdf/2104.08718 section 4.4.
(section 4.4 of CLIPScore: A Reference-free Evaluation Metric for Image Captioning)
we sample a (FOIL, true) pair, and compute the accuracy of each evaluation metric in their capacity to assign a higher score to the true candidate versus the FOIL.
Hello, there. I have question about table 2. I am confused AP in Table 2 is accuracy whether metric(ALOHa, CLIPScore) would give better score to "not FOIL" one over FOIL, following procedure in CLIPScore paper https://arxiv.org/pdf/2104.08718 section 4.4.
(section 4.4 of CLIPScore: A Reference-free Evaluation Metric for Image Captioning)