ArrowLuo / CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
https://arxiv.org/abs/2104.08860
MIT License
879 stars 123 forks source link

MSRVTT downsampling #17

Closed yookoon closed 3 years ago

yookoon commented 3 years ago

Hi, great work and thanks for sharing the code.

I'm trying to reproduce the results on MSRVTT for comparison but the training is taking longer than expected (~6 hours/epoch) The bottleneck is presumably in the data loading. In #8 I read that you downsampled the videos in advance. Can you explain how you downsampled the videos and share the script if possible?

ArrowLuo commented 3 years ago

Hi @yookoon, I have upload the script we used. FYI.

yookoon commented 3 years ago

Thank you so much!