Closed dcahn12 closed 12 months ago
@dcahn12 how about other video benchmarks ?
Refer to this issue.
@xmy0916 I only tested MSVD Video-QA.
Thanks for your contribution!
I tried to reproduce your result (Zero-shot VideoQA on MSVD dataset) with the pretrained weight https://huggingface.co/LanguageBind/Video-LLaVA-7B/tree/main.
But the result is completely different from your paper. (Reproduced result is shown below)
Can you check this?
My test results on MSVD:
Yes count: 4041
No count: 9116
Accuracy: 0.30713688530820094
Average score: 2.726077373261382
Thanks for your contribution!
I tried to reproduce your result (Zero-shot VideoQA on MSVD dataset) with the pretrained weight https://huggingface.co/LanguageBind/Video-LLaVA-7B/tree/main.
But the result is completely different from your paper. (Reproduced result is shown below)
Can you check this?