YehLi / xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Other
1.03k stars 111 forks source link

msvd feature #56

Open codingforwhat opened 1 year ago

codingforwhat commented 1 year ago

why i can't download the msvd_dataset feature ,it's empty, don't contain any .npy files. thanks for your answer.

JingyuLi-code commented 4 months ago

Can you download the relevant msvd_dataset feature? Can you share the link?