showlab / all-in-one

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
https://arxiv.org/abs/2203.07303
277 stars 16 forks source link

Annotation (Caption file) missing for HowTo100M dataset #6

Closed jacobswan1 closed 2 years ago

jacobswan1 commented 2 years ago

Dear author,

Thank you for your great effort, especially the very neat & organized code for data loading! That helps a lot for our research. In the meanwhile, I'm wondering do you plan to release the processed caption file for the HowTo100M dataset? As it seems it's not released in the Google Driver. I understand that you must use the official provided caption file but just would like to check with you first as your processed file might be fitting the data loader more easily.

Appreciate your help in advance.

Regards

FingerRec commented 2 years ago

Hi jacbobswan1:

We download caption files from https://www.rocq.inria.fr/cluster-willow/amiech/howto100m/howto100m_captions.zip. We decompress this file in the same directory of original video, load captions as below (line119-121) in hotwo100m.py:.

    def get_caption_path(self, sample):
        # example xx/xx/xx.mp4 -> xx.csv
        return os.path.join(self.caption_dir, sample.split('/')[-1].split('.')[0] + '.csv')

If you have difficult in downloading howto100m, you can refer to MIL-NCE for more information.

jacobswan1 commented 2 years ago

Thank you for the instruction! I'll follow as you suggested.

Yours Jacob