PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
https://arxiv.org/pdf/2311.10122.pdf
Apache License 2.0
2.88k stars 207 forks source link

pretrained checkpoint #173

Open OliverLeeXZ opened 3 months ago

OliverLeeXZ commented 3 months ago

Great job! Could you please release the pretrained checkpoint of video-llava so that we can use it in fine-tuning stage? Thanks!