mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
https://mbzuai-oryx.github.io/Video-ChatGPT
Creative Commons Attribution 4.0 International
1.17k stars 102 forks source link

Model inference without Gradio #57

Closed tacit0428 closed 11 months ago

tacit0428 commented 11 months ago

Thank you for your interesting work! I want to directly use it by code so that I can process batch of videos instead of using UI page. Could you please provide inference code?

mmaaz60 commented 11 months ago

Hi @tacit0428

Please have a look at #59, thanks