issues
search
Efficient-Large-Model
/
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Apache License 2.0
865
stars
55
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Multi image inference quality
#79
oroojlooy
opened
1 day ago
0
The inference video reports an error: ValueError: Unable to create tensor, you should probably activate padding with 'padding=True' to have batched tensors with the same length.
#78
changqinyao
opened
2 days ago
2
Question about the output
#77
DwanZhang-AI
opened
5 days ago
2
What is the --conv-mode of VILA1.5-13b?
#76
DwanZhang-AI
opened
5 days ago
2
added functionality to process a bunch of videos at a time
#75
poorfrombabylon
closed
1 week ago
0
OpenVLM leaderboard
#74
oroojlooy
opened
1 week ago
0
VILA Context-length
#73
oroojlooy
closed
1 week ago
2
Why setting LLaMa3's padding direction to "right"?
#72
ROIM1998
opened
3 weeks ago
1
Bug in conversation.py
#71
zhang-jr
opened
1 month ago
0
Finetuning
#70
RohanR04
closed
3 weeks ago
0
About VILADistributedSampler and gradient_accumulation_steps
#69
dreamerlin
opened
1 month ago
1
Access to pretrained model weights
#68
zzxslp
opened
1 month ago
3
VILA-1.5 details
#67
Lopa07
closed
1 month ago
4
How does the VILA preprocessed video?
#66
MonolithFoundation
opened
1 month ago
1
Does S2 able to unfreeze vit to train?
#65
MonolithFoundation
closed
1 week ago
1
Fix vision engine build
#64
meenchen
closed
1 month ago
0
What is the LLM used for VILA 1.5 40B?
#63
javier-m
closed
1 month ago
1
math dataset incomplete description
#62
hubenjm
opened
1 month ago
2
YouCook2 code to generate video clips from raw videos?
#61
hubenjm
opened
1 month ago
4
RuntimeError: GET was unable to find an engine to execute this computation
#60
pribadihcr
opened
1 month ago
1
No module named 'llava.tf_utils'
#59
pribadihcr
closed
1 week ago
5
Would you consider releasing code that supports lora training 40b model?
#58
Key-lei
opened
1 month ago
1
When will new annotations files be available?
#57
hubenjm
closed
1 month ago
1
"No module named llava"
#56
vedantroy
closed
1 month ago
1
How's the DownSampleBlock performance compare with CAbstractor?
#55
lucasjinreal
opened
1 month ago
3
Potential bug in mm_utils.py process_image function
#54
hubenjm
opened
1 month ago
1
working with VLLM
#53
kousun12
opened
1 month ago
1
How to evaluate 4shot?
#52
leexinhao
opened
1 month ago
0
Running the AWQ models
#51
signine
opened
1 month ago
3
Provide ShareGPT4V filtered annotations file
#50
hubenjm
opened
1 month ago
0
About perception testset
#49
mary-0830
opened
1 month ago
3
Inference not working - Keyword tensor should have 2 or 3 dimensions, got 1
#48
signine
opened
1 month ago
5
demo_trt_llm/convert_checkpoint.py - AttributeError: 'LlavaLlamaConfig' object has no attribute 'num_attention_heads'
#47
dimakan
closed
1 month ago
3
Hi, Have you compare with s2 [384, 768] scales versus interpolate to 768x768?
#46
OpenJarvisAI
opened
1 month ago
6
Add support for GPUs with compute capability lower than 8.0 for awq/kernels installation
#45
rahulthakur319
opened
1 month ago
1
Fix for backwards compatibility
#44
michael-heinrich
opened
1 month ago
0
fix: PR #40 other bug.
#43
SeanCraven314
closed
1 month ago
4
Request for middle checkpoint
#42
jihaonew
opened
1 month ago
3
Easy backwards compatibility fix
#41
michael-heinrich
opened
1 month ago
3
fix: Fix tensor shape error, during llava inference.
#40
SeanCraven314
closed
1 month ago
1
Llama-3-VILA1.5-8B Inference error
#39
joebradly
opened
1 month ago
11
Updated paper on the latest model (video understanding, etc.)
#38
thecooltechguy
opened
1 month ago
4
Chamfer distance's data source
#37
threegold116
closed
1 month ago
2
Instruction for VILA 1.5 with tinychat (llm-awq) doesn't work well due to fixed torch version (==2.0.1)
#36
gigony
opened
1 month ago
5
Update readme of VILA1.5
#35
kentang-mit
closed
1 month ago
0
vila1.5 release
#34
Efficient-Large-Language-Model
closed
1 month ago
0
vila1.5 release
#33
Efficient-Large-Language-Model
closed
1 month ago
0
video
#32
Efficient-Large-Language-Model
closed
1 month ago
0
Possibility to support LLama-3?
#31
hzhang57
closed
1 month ago
1
LLM version
#30
gordonhu608
closed
2 months ago
0
Next