OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
https://vchat.opengvlab.com/
MIT License
2.85k stars 230 forks source link

VideoChat2 Raw Training video download #176

Open mingzeG opened 1 month ago

mingzeG commented 1 month ago

Great Work! I was hoping to quickly get the raw video dataset used and try to train a videochat2, how could I get a filtered raw video dataset rather than downloading all the video datasets that the data.md mentioned. It's too memory consuming

yinanhe commented 1 month ago

Most of the videos are common and relatively easy to obtain. For some datasets that are more difficult to access, for EgoQA videos, download them from this link. For VideoChat2 conversation videos, download them from link. For Youcook, you can download it from link

pengzhiliang commented 1 month ago

Hello, @yinanhe, how to download the CLEVRER dataset? The link shows noting.

yinanhe commented 1 month ago

@pengzhiliang
You can download from the link below. Training Videos, Annotations, Questions and Answers Validation Videos, Annotations, Questions and Answers Testing Videos, Questions Object Masks and Attributes Readme

pengzhiliang commented 1 month ago

@yinanhe Thanks very much 🍻

schopra8 commented 1 month ago

@yinanhe Do you have alternative links? The videos links aren't working for me.

yinanhe commented 1 month ago

@schopra8 We no longer have any other links. If you are still having difficulties in getting the data, please let me know which dataset it is.

schopra8 commented 1 month ago

Apologies for the lack of specificity! I mean the CLEVRER dataset specifically. The question+answers and README load for me -- but the videos don't, when I click on those links.

I reached out to the original authors of the CLEVRER dataset as well, but they haven't responded.

schopra8 commented 1 month ago

@yinanhe -- I'm also struggling to find the VideoChat videos used in the conversation and caption annotations. Do you have any pointers to where I can download this data? Thank you in advance!

yinanhe commented 1 month ago

Apologies for the lack of specificity! I mean the CLEVRER dataset specifically. The question+answers and README load for me -- but the videos don't, when I click on those links.

I reached out to the original authors of the CLEVRER dataset as well, but they haven't responded.

@schopra8 how about use link in https://github.com/OpenGVLab/Ask-Anything/issues/176#issuecomment-2121805009. In my network environment, the download is normal.

yinanhe commented 1 month ago

@yinanhe -- I'm also struggling to find the VideoChat videos used in the conversation and caption annotations. Do you have any pointers to where I can download this data? Thank you in advance!

The videos used here are from YouTube, and some videos might be found in the InternVid dataset on opendatlab.com.

schopra8 commented 1 month ago

Thank you @yinanhe!

schopra8 commented 1 month ago

I now see that the VideoChat data names correspond to WebVid like VideoChat2 -- and can resolve the videos.

@yinanhe - Thanks for all the help! It might be helpful to update the Data.md file to clarify that VideoChat corresponds to videos from WebVid. The link to InternVid threw me off -- and had me incorrectly looking at the InternVid-10M Dataset .