Closed alphadadajuju closed 3 years ago
@alphadadajuju yes you are right. To calculte f-mAP you need ground truths, such that you can calculate the IoU with ground truths and predictions. Therefore, we have calculated f-mAP where we have annottions. However, for video-mAP we have used untrimmed videos.
For AVA dataset, only frame-mAP is used for evaluation and annotations are provided for sampling rate of 1 Hz.
Thank you for your clarifications!
Thank you for sharing your amazing work (I see that you even verified YOWO's performance on AVA recently)!
I do have some questions related to how you evaluated UCF-24's frame mAP, and was hoping you could help me clarify.
(It may be the case that I didn't fully grasp your code ... please don't hesitate to correct me if I made a mistake!) Thank you again.