issues
search
PKU-YuanGroup
/
Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
https://arxiv.org/pdf/2311.10122.pdf
Apache License 2.0
3.04k
stars
220
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
I can't try out the Gradio Web UI in my local environment
#203
mKatsumata
opened
3 days ago
0
How to inference the model with lora?
#202
sterzhang
opened
6 days ago
0
Can you Fix the DEMO. Demo is no longer working
#201
thisurawz1
opened
6 days ago
0
QLoRA support for v1.5
#200
Gokulakrishnan-DL-CV
opened
1 week ago
0
执行推理脚本报错
#199
crystal-ltf
opened
1 week ago
0
RuntimeError: The size of tensor a (2074) must match the size of tensor b (19) at non-singleton dimension 3
#198
zky-kf
closed
2 weeks ago
0
Can not reproduce the results on MSVD-QA and TGIF-QA
#197
Jingchensun
opened
3 weeks ago
0
Runtime error in video-llava huggingface spaces.
#196
helpmeIamnewbie
opened
3 weeks ago
0
Help with evaluation script for lora finetuned model.
#195
marvlyngkhoi
opened
1 month ago
0
Hardware Requirement for the model to run in LORA
#194
leochang123
opened
1 month ago
0
Videochatgpt tuning data encounters some error
#193
Lexarymade
opened
1 month ago
1
Can you Fix the DEMO. Demo is no longer working
#192
thisurawz1
opened
1 month ago
0
Can't reproduce results on MSRVTT and MSVD dataset
#191
1999Lyd
closed
2 months ago
3
Issues with Converting the video-llava Model to ONNX
#190
Ark1a
closed
3 months ago
0
Pretrain and Finetune template versions
#189
xin-li-67
opened
3 months ago
1
When I evaluated the ‘TGIF_Zero_Shot_QA’ dataset, the accuracy was only 13%. Should I train first to achieve the 70% accuracy in the paper?
#188
FanshuoZeng
opened
3 months ago
0
May I ask what is the api_base used for evaluation?
#187
FanshuoZeng
opened
3 months ago
0
missing file: preprocessor_config.json
#186
JunanPan
closed
4 months ago
1
Can this model apply a few-shot when inference?
#185
Ijustakid
opened
4 months ago
0
ImportError: cannot import name '_expand_mask' from 'transformers.models.clip.modeling_clip'
#184
qiuchen001
opened
4 months ago
4
Update llava_arch.py for device synchronization of mm_projector outputs
#183
FangXinyu-0913
opened
4 months ago
0
Valley video not found during pretraining.
#182
Aakriti05
opened
4 months ago
0
Api is not running properly getting errors In each endpoint
#181
RAJA102002
opened
4 months ago
0
Questions about LanguageBind Usage
#180
lingjunzhao
opened
4 months ago
0
Multi-GPU inference problem.
#179
jiazheng-xing
opened
4 months ago
1
Request for Inference Parameters on VideoLLava
#178
adrianwestmoon
opened
4 months ago
0
Is it possible to train with languages other than English, and are the 8 frames sampled uniformly across different video lengths?
#177
YoungjaeDev
opened
4 months ago
0
size mismatch
#176
cs19469
opened
4 months ago
0
error:RuntimeError: Error(s) in loading state_dict for CLIPVisionModel: size mismatch for vision_model.embeddings.class_embedding: copying a param with shape torch.Size([1024]) from checkpoint, the shape in current model is torch.Size([768]).
#175
zapqqqwe
opened
5 months ago
3
total_frames zero error
#174
OliverLeeXZ
opened
5 months ago
0
pretrained checkpoint
#173
OliverLeeXZ
opened
5 months ago
0
How to increase the sample frames amount
#172
sherlock666
opened
5 months ago
0
Issues with finetune_lora.sh
#171
shag1802
opened
5 months ago
2
Error with Gradio Client: Please Upgrade Gradio to 4.x and Redeploying HuggingFace Space
#170
zhanwenchen
opened
5 months ago
0
Question Regarding Video Frame Processing
#169
Kkkaystone
opened
5 months ago
0
How to install videollava together with xformer?
#168
zengbohan0217
opened
5 months ago
0
extremely slow with transformers
#167
RaulKite
opened
5 months ago
0
Training help
#166
felmoreno1726
opened
6 months ago
0
About class embedding
#165
feiyu12138
opened
6 months ago
0
Video-LLava Upgradation
#164
Tortoise17
opened
6 months ago
1
ERROR opening+moov atom not found+mmco: unref short failure
#163
Frank-Dg
opened
6 months ago
2
Can the confidence coefficient of an answer be obtained?
#162
IsabelJimenez99
closed
6 months ago
0
Inference model path unclear
#161
Ali2500
opened
6 months ago
0
Fix typo and additional instruction in readme
#160
nahidalam
closed
6 months ago
0
Please specifiy library versions
#159
nahidalam
opened
6 months ago
0
Update README.md
#158
OliverGrace
opened
6 months ago
0
Uri validation issue on Replicate
#157
Gab1988
closed
6 months ago
1
Video-LLaVa now available in the Transformers library!
#156
zucchini-nlp
opened
6 months ago
56
chore: Update processing_video.py
#155
githubartema
opened
6 months ago
0
The problem about the environment
#154
swiftCC
closed
6 months ago
0
Next