Hi, thanks for your great work!
I'm checking at the new released model internVideo2, it's interesting!
I saw demo.ipynb files in multi_modality folder, it can calculate text prob.
I'm wondering if there's any demo code for video QA and video Captioning?
Hi, thanks for your great work! I'm checking at the new released model internVideo2, it's interesting! I saw demo.ipynb files in multi_modality folder, it can calculate text prob. I'm wondering if there's any demo code for video QA and video Captioning?