[Discussion] We are contributing 🎉🎉🎉Video-LLaVA

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

https://llava.hliu.cc

Apache License 2.0

20.18k stars 2.23k forks source link

Open LinB203 opened 11 months ago

LinB203 commented 11 months ago

Hello, esteemed LLaVA developer, thank you for contributing such robust code and data to the community.

We have extended LLaVA to Video-LLaVA to achieve advanced performance on MSRVTT,MSVD,TGIF,ACTIVITYNET.

Thank you again for your contributions to the large visual-language model!

LinB203 commented 11 months ago