MSVD-Indonesian is derived from the MSVD (Microsoft Video Description) dataset, which is obtained with the help of a machine translation service (Google Translate API). This dataset can be used for multimodal video-text tasks, including text-to-video retrieval, video-to-text retrieval, and video captioning. Same as the original English dataset, the MSVD-Indonesian dataset contains about 80k video-text pairs.
Dataloader name:
id_msvd/id_msvd.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?id_msvd