Closed jiminbot20 closed 4 years ago
If you are asking how each metrics works, e.g. BLEU, SPICE, METOR ..., then you probably want to read corresponding papers. If you are asking how we are computing the metrics using a prediction "pred" and five captions "cap_1, ..., cap_5", then we are using all five captions as reference. So, use BLEU as an example, the metric would be BLEU(pred, [cap_1, ...., cap_5]). You could see code in coco_caption for more details. There is also a github repository of coco_caption.
is it fine to understand, calculate metrics with all 5 captions as target and then taking average?
보낸 사람: lukewys notifications@github.com 보낸 날짜: 2020년 9월 24일 목요일 오후 2:44 받는 사람: lukewys/dcase_2020_T6 dcase_2020_T6@noreply.github.com 참조: 전지민 jiminbot20@gm.gist.ac.kr; Author author@noreply.github.com 제목: Re: [lukewys/dcase_2020_T6] I cannot understand the measure process. Can you plz explain? (#6)
If you are asking how each meterics works, e.g. BLEU, SPICE, METOR ..., then you probably want to read corresponding papers. If you are asking how we are computing the loss using a prediction "pred" and five captions "cap_1, ..., cap_5", then we are using all five captions as reference. So, use BLEU as example, the loss would be BLEU(pred, [cap_1, ...., cap_5]). You could see code in coco_caption for more details. There is also a github repositery of coco_caption.
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/lukewys/dcase_2020_T6/issues/6#issuecomment-698125827, or unsubscribehttps://github.com/notifications/unsubscribe-auth/APCT4SCV5FWZWW3CRZVJ2RTSHLMDVANCNFSM4RX3L2RQ.
I cannot say yes because I forgot how exactly it works. You could look at the metrics computing code for more details. But one thing for sure, it considers all five captions when computing the metrics.
Thanks I will look into it!
There is 5 captions for 1 audio wav file. Then, when you measure metrics such as BLEU, SPICE etc. what is the target between 5 captions? For example,
In this audio file, what would be the target?
I could not understand even I keep reading papers