PKU-YuanGroup Video-LLaVA issues

PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

https://arxiv.org/pdf/2311.10122.pdf

Apache License 2.0

3.02k stars 220 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

How to inference the model with lora?

#202 sterzhang opened 3 hours ago
0
Can you Fix the DEMO. Demo is no longer working

#201 thisurawz1 opened 15 hours ago
0
QLoRA support for v1.5

#200 Gokulakrishnan-DL-CV opened 5 days ago
0
执行推理脚本报错

#199 crystal-ltf opened 6 days ago
0
RuntimeError: The size of tensor a (2074) must match the size of tensor b (19) at non-singleton dimension 3

#198 zky-kf closed 1 week ago
0
Can not reproduce the results on MSVD-QA and TGIF-QA

#197 Jingchensun opened 2 weeks ago
0
Runtime error in video-llava huggingface spaces.

#196 helpmeIamnewbie opened 3 weeks ago
0
Help with evaluation script for lora finetuned model.

#195 marvlyngkhoi opened 4 weeks ago
0
Hardware Requirement for the model to run in LORA

#194 leochang123 opened 1 month ago
0
Videochatgpt tuning data encounters some error

#193 Lexarymade opened 1 month ago
1
Can you Fix the DEMO. Demo is no longer working

#192 thisurawz1 opened 1 month ago
0
Can't reproduce results on MSRVTT and MSVD dataset

#191 1999Lyd closed 2 months ago
3
Issues with Converting the video-llava Model to ONNX

#190 Ark1a closed 3 months ago
0
Pretrain and Finetune template versions

#189 xin-li-67 opened 3 months ago
1
When I evaluated the ‘TGIF_Zero_Shot_QA’ dataset, the accuracy was only 13%. Should I train first to achieve the 70% accuracy in the paper?

#188 FanshuoZeng opened 3 months ago
0
May I ask what is the api_base used for evaluation?

#187 FanshuoZeng opened 3 months ago
0
missing file: preprocessor_config.json

#186 JunanPan closed 4 months ago
1
Can this model apply a few-shot when inference?

#185 Ijustakid opened 4 months ago
0
ImportError: cannot import name '_expand_mask' from 'transformers.models.clip.modeling_clip'

#184 qiuchen001 opened 4 months ago
4
Update llava_arch.py for device synchronization of mm_projector outputs

#183 FangXinyu-0913 opened 4 months ago
0
Valley video not found during pretraining.

#182 Aakriti05 opened 4 months ago
0
Api is not running properly getting errors In each endpoint

#181 RAJA102002 opened 4 months ago
0
Questions about LanguageBind Usage

#180 lingjunzhao opened 4 months ago
0
Multi-GPU inference problem.

#179 jiazheng-xing opened 4 months ago
1
Request for Inference Parameters on VideoLLava

#178 adrianwestmoon opened 4 months ago
0
Is it possible to train with languages other than English, and are the 8 frames sampled uniformly across different video lengths?

#177 YoungjaeDev opened 4 months ago
0
size mismatch

#176 cs19469 opened 4 months ago
0
error:RuntimeError: Error(s) in loading state_dict for CLIPVisionModel: size mismatch for vision_model.embeddings.class_embedding: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([768]).

#175 zapqqqwe opened 4 months ago
3
total_frames zero error

#174 OliverLeeXZ opened 4 months ago
0
pretrained checkpoint

#173 OliverLeeXZ opened 5 months ago
0
How to increase the sample frames amount

#172 sherlock666 opened 5 months ago
0
Issues with finetune_lora.sh

#171 shag1802 opened 5 months ago
2
Error with Gradio Client: Please Upgrade Gradio to 4.x and Redeploying HuggingFace Space

#170 zhanwenchen opened 5 months ago
0
Question Regarding Video Frame Processing

#169 Kkkaystone opened 5 months ago
0
How to install videollava together with xformer?

#168 zengbohan0217 opened 5 months ago
0
extremely slow with transformers

#167 RaulKite opened 5 months ago
0
Training help

#166 felmoreno1726 opened 5 months ago
0
About class embedding

#165 feiyu12138 opened 5 months ago
0
Video-LLava Upgradation

#164 Tortoise17 opened 6 months ago
1
ERROR opening+moov atom not found+mmco: unref short failure

#163 Frank-Dg opened 6 months ago
2
Can the confidence coefficient of an answer be obtained?

#162 IsabelJimenez99 closed 6 months ago
0
Inference model path unclear

#161 Ali2500 opened 6 months ago
0
Fix typo and additional instruction in readme

#160 nahidalam closed 6 months ago
0
Please specifiy library versions

#159 nahidalam opened 6 months ago
0
Update README.md

#158 OliverGrace opened 6 months ago
0
Uri validation issue on Replicate

#157 Gab1988 closed 6 months ago
1
Video-LLaVa now available in the Transformers library!

#156 zucchini-nlp opened 6 months ago
56
chore: Update processing_video.py

#155 githubartema opened 6 months ago
0
The problem about the environment

#154 swiftCC closed 6 months ago
0
Some weights of the model checkpoint at "./Video-LLaVA-7B" were not used when initializing LlavaLlamaForCausalLM:

#153 ssuncheol opened 6 months ago
0