X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Other
1.02k
stars
111
forks
source link
Folder of MSVD dataset features in google drive is empty #47
Thank you for sharing this great work The folder of MSVD dataset features in google drive is empty for these links: (https://drive.google.com/drive/folders/1vx9n7tAIt8su0y_3tsPJGvMPBMm8JLCZ?usp=sharing) (https://drive.google.com/drive/folders/1-jvt6aKMDmhZC03DPEpwwgYxeL4PSD5J) I need the script file for extracting the MSVD features to extract these features for videos by myself, can you send it to me please (ajalal289@gmail.com)