video data - Githubissues

kuaileqipaoshui commented 5 months ago

Excuse me, I encountered some difficulties when I tried to download the webvid-10M dataset. After downloading the csv file, I download the original video in the way he said. it work, but there is nothing in the folder. Is that normal? Or is there any way to handle .csv and .tsv files ? Are downloaded .csv and .tsv files pre-processed into source videos and source pictures?

yinanhe commented 5 months ago

Excuse me, I encountered some difficulties when I tried to download the webvid-10M dataset. After downloading the csv file, I download the original video in the way he said. it work, but there is nothing in the folder. Is that normal? Or is there any way to handle .csv and .tsv files ? Are downloaded .csv and .tsv files pre-processed into source videos and source pictures?

I'm sorry for the delayed response. If you have downloaded the original annotation files for webvid, the CSV should contain the download URLs for each video. You can use tools like wget or request to download them, and each folder should have the respective videos, which should not be empty. For specific download details, you can ask in the original repository. The original videos have not been processed into frames or any other preprocessing.

yinanhe commented 5 months ago

I also have a question. What is the difference between webvid and webvid_10m? And how can I download cc3m and sbu data?

I also have a question. What is the difference between webvid and webvid_10m? And how can I download cc3m and sbu data?

webvid refers to WebVid2M, which is a subset of 2.5M. For more details, please refer to WebVid. You can find the CC dataset on GitHub, and the SBU dataset on GitHub.

OpenGVLab / Ask-Anything

video data #98