iejMac / clip-video-encode

Easily compute clip embeddings from video frames
MIT License
132 stars 19 forks source link

Questions on Data donwload #58

Closed Buzz-Beater closed 1 year ago

Buzz-Beater commented 1 year ago

Hi, Thanks for the good work. I was trying to make use of the pre-extracted video features on WebVid10M and was wondering if there could be any instructions on that. I followed the huggingface dataset hub instructions and wasn't able to download it correctly as it is recording some ''missing metadata'' error.

iejMac commented 1 year ago

@Buzz-Beater Hmmm yeah I should probably make some example for how to use the our WebDataset format. So we just store the data on huggingface but in order to use it you need to use your dataset reader class. What you want to do is download the tar files from huggingface by cloning the dataset, then you want to run to 'git lfs install' inside the cloned dataset repository. At that point you'll have the tar files downloaded and you want to input them into this class, you can just import it via from clip_video_encode.dataset import EmbeddingWebDatasetReader. After that you can initialize with a string like: '/path/to/tar/files/{000000000..000001082}.tar' for the val_urls parameter. Double check how many tars there are, I think I remember 1082 but there may be more/less.

Buzz-Beater commented 1 year ago

Got it, thanks for the instructions!