OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Apache License 2.0
1.3k stars 85 forks source link

is there a demo code for video QA and video Captioning? #104

Open LanHao0 opened 5 months ago

LanHao0 commented 5 months ago

Hi, thanks for your great work! I'm checking at the new released model internVideo2, it's interesting! I saw demo.ipynb files in multi_modality folder, it can calculate text prob. I'm wondering if there's any demo code for video QA and video Captioning?

YajieW99 commented 5 months ago

+1

2811668688 commented 2 months ago

have you solved your problem also confused about how to do video captioning

Varun-GP commented 1 month ago

+1

Varun-GP commented 1 month ago

Could you please provide the demo code for the Video QA task for the Internvideo2 model?