mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
https://mbzuai-oryx.github.io/Video-ChatGPT
Creative Commons Attribution 4.0 International
1.23k stars 108 forks source link

MSVD-QA and MSRVTT-QA evaluation #68

Closed KerolosAtef closed 11 months ago

KerolosAtef commented 1 year ago

for the results in the few shot table which file did you use test_qa or val_qa for MSVD-QA and MSRVTT-QA evaluation

hanoonaR commented 11 months ago

Hi @KerolosAtef ,

Thank you for your interest in our work and apologies for the late response. We use the validation set for MSVD and MSRVTT and the test set for TGIF. A copy of the annotation files have been attached in the links.

Thank you