mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
https://mbzuai-oryx.github.io/Video-ChatGPT
Creative Commons Attribution 4.0 International
1.05k stars 92 forks source link

Adding single video inference code without gradio web app. #59

Closed ashmalvayani closed 8 months ago

ashmalvayani commented 8 months ago

With this piece of code, you'll be able to run this file to run your inference on a sample video and get the results of your VQA model on the same terminal. The instructions on how to run the code are in the top comment of the single_video_inference.py code. Please replace the paths with the actual paths.

mmaaz60 commented 8 months ago

Thanks @ashmalvayani

solves #53 #57

ds-gong commented 3 months ago

Hi, excuse me. I encountered this error when running single_video_inference.py. What could be the reason for this? Could you help me take a look? 20240316-214607