Closed hgzjy25 closed 3 years ago
So the Moment Retrieval results presented in table 5 is wrong, R@1 on TVR dataset should be up to 15%.
Hi @hgzjy25,
Thanks! It seems that you are right. We directly adopted this part of the code from the official release of TVRetrieval without further validation. Sorry about that. We have double confirmed with the original authors of TVR and they have it fixed here: https://github.com/jayleicn/TVRetrieval/commit/aec36bc3aa63d22ec5137b1f4a320257caf4d0dd, which will be reflected in HERO shortly.
VCMR should not be affected, as the results are preprocessed first by https://github.com/linjieli222/HERO/blob/f938515424b5f3249fc1d2e7f0373f64112a6529/utils/tvr_eval_utils.py#L132
Thank you for your quick reply and action. But there still remains a mistake. _video_id = e["predictions"][0]
should be _video_id = e["predictions"][0][0]
, otherwise _video_id
will be the first sublist in e["predictions"]
instead of the real video idx.
e['predictions']
inpost_processing_svmr_nms
has four elements[video_idx (int), st (float), ed(float), score (float)]
each sublist, and the functiontemporal_non_maximum_suppression
requires argumentpredictions
with three elements[st (float), ed(float), score (float)]
each sublist. So tstart, tend, tscore will be video_idx, st, ed, respectively. I fix it by myself and test the code. But it seems that this problem have no effect on vcmr results?