Closed ikodoh closed 8 months ago
Please check here https://github.com/OpenGVLab/unmasked_teacher/issues/12#issuecomment-1723248511. I have uploaded the test files we used.
Thanks for uploading test file.
However, in the test file, one video includes multiple sentences and I still have no idea to handle those multiple sentences when training and test. Can you provide further implementation details for this?
Thanks for the response.
As I understand, multiple sentences per one video is regarded as an individual sample. This does not make any problem in training but I think that one video has to match one sentence during test since one sentence has to be retrieved by one video. How did you deal with this?
In my opinion, during testing, it is like a question with multiple answers.
Hi, I'd like to cite your work as a baseline of our project. Can you provide me the updated results of text-to-video retrieval and video-to-text retrieval on MSVD? Current results are extremely high.
Hi! I have updated some results in MODEL_ZOO. The b_17M is still running.
All the results have been updated.
Thank you for sharing the results.
Thank you for sharing great work.
I'm trying to reproduce MSVD retrieval result but there are some minor questions. How did you handle multiple sentences per one video in training and test, respectively. I think that each sentence can be regarded as individual samples during training, but one sentence has to be selected per one video during test. Can you share your protocol to handle this?
Thanks again for your great work.