OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
https://vchat.opengvlab.com/
MIT License
2.86k stars 230 forks source link

video data #98

Closed kuaileqipaoshui closed 5 months ago

kuaileqipaoshui commented 5 months ago

Excuse me, I encountered some difficulties when I tried to download the webvid-10M dataset. After downloading the csv file, I download the original video in the way he said. it work, but there is nothing in the folder. Is that normal? Or is there any way to handle .csv and .tsv files ? Are downloaded .csv and .tsv files pre-processed into source videos and source pictures?

videochat6 videochat10 videochat11
yinanhe commented 5 months ago

Excuse me, I encountered some difficulties when I tried to download the webvid-10M dataset. After downloading the csv file, I download the original video in the way he said. it work, but there is nothing in the folder. Is that normal? Or is there any way to handle .csv and .tsv files ? Are downloaded .csv and .tsv files pre-processed into source videos and source pictures? videochat6 videochat10 videochat11

I'm sorry for the delayed response. If you have downloaded the original annotation files for webvid, the CSV should contain the download URLs for each video. You can use tools like wget or request to download them, and each folder should have the respective videos, which should not be empty. For specific download details, you can ask in the original repository. The original videos have not been processed into frames or any other preprocessing.

yinanhe commented 5 months ago

I also have a question. What is the difference between webvid and webvid_10m? videochat8 And how can I download cc3m and sbu data? videochat9

I also have a question. What is the difference between webvid and webvid_10m? videochat8 And how can I download cc3m and sbu data? videochat9

webvid refers to WebVid2M, which is a subset of 2.5M. For more details, please refer to WebVid. You can find the CC dataset on GitHub, and the SBU dataset on GitHub.