ttengwang / PDVC

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
MIT License
200 stars 23 forks source link

Some questions regarding the dataset #39

Closed MrAccelerator closed 1 year ago

MrAccelerator commented 1 year ago

I would like to ask a question about the dataset. I looked at some papers, and I found that some papers are testing the modal performance on the ActivityNet Captions validation set, while some papers are testing the modal performance on the ActivityNet Captions test split. Is there the difference between ActivityNet Captions validation set and ActivityNet Captions test split?

ttengwang commented 1 year ago

The validation set and test set are both from the official splits[1] but the annotations of the test set is held by the ActivityNet Challenge organizers. Some early papers report performance on test split by evaluating on the online test server. However, the online server is only accessible during the Challenge, so some recent papers report the results on validation set as a compromise.

[1] R. Krishna, K. Hata, F. Ren, L. Fei-Fei, and J. C. Niebles, “Dense-captioning events in videos,” in Proc. IEEE Int. Conf. Comput. Vis., 2017, pp. 706-715.

MrAccelerator commented 1 year ago

Thanks for your quick response, this solved my confusion.