OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
https://vchat.opengvlab.com/
MIT License
2.84k stars 229 forks source link

Json files of pretraining dataset #173

Open qtli opened 1 month ago

qtli commented 1 month ago

I was following DATA.md to download pretraining dataset.

However, I cannot find webvid_10m_train.json, cc12m_train.json, and so on from OpenGVLab/VideoChat2-IT repository. I was wondering how to download these annotation files to place under anno_pretrain/ directory?

qtli commented 1 month ago

Are there any kind people to help me out? Thanks in advance!

bexxnaz commented 1 month ago

You can download these datasets from the following links:

webvid-10M: TempoFunk/webvid-10M cc12m: GitHub Repository