mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
https://mbzuai-oryx.github.io/Video-ChatGPT
Creative Commons Attribution 4.0 International
1.05k stars 93 forks source link

What training-dataset do you use for evaluating zero-shot ActivityNet-QA? #15

Closed dcahn12 closed 1 year ago

dcahn12 commented 1 year ago

Hi!

As I read the paper, I couldn't find any training dataset except ActivityNet 100K pairs for instruction tuning. What training-dataset do you use for evaluating zero-shot ActivityNet-QA?