【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
3.02k
stars
220
forks
source link
Some weights of the model checkpoint at "./Video-LLaVA-7B" were not used when initializing LlavaLlamaForCausalLM: #153
Open
ssuncheol opened 6 months ago
When I execute Video-LLaVA-7B to make a text, the following issue occurred. How to solve this problem. Scripts and models are shown below.
Script : https://github.com/PKU-YuanGroup/Video-LLaVA?tab=readme-ov-file#inference-for-video
LanguageWind_Image : https://huggingface.co/LanguageBind/LanguageBind_Image
LanguageBind_Video_merge : https://huggingface.co/LanguageBind/LanguageBind_Video_merge