haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
https://llava.hliu.cc
Apache License 2.0
20.18k stars 2.23k forks source link

[Discussion] We are contributing πŸŽ‰πŸŽ‰πŸŽ‰Video-LLaVA #825

Open LinB203 opened 11 months ago

LinB203 commented 11 months ago

Discussion

Hello, esteemed LLaVA developer, thank you for contributing such robust code and data to the community.

We have extended LLaVA to Video-LLaVA to achieve advanced performance on MSRVTT,MSVD,TGIF,ACTIVITYNET.

Thank you again for your contributions to the large visual-language model!

LinB203 commented 11 months ago
sota