microsoft / XPretrain

Multi-modality pre-training
Other
472 stars 36 forks source link

video caption of HD-VILA-100M Dataset #29

Closed zyyyz closed 1 year ago

zyyyz commented 1 year ago

Thank you for collecting and making public such a large video-text dataset. Is the text description dataset for each video publicly available? Where can we download the text caption of the video?

bei21 commented 1 year ago

Please email me with the address shown on the page. Thanks.