issues
search
bfshi
/
scaling_on_scales
When do we not need larger vision models?
MIT License
257
stars
7
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
About Vstar Evaluation
#14
jungle-gym-ac
opened
1 week ago
1
How to infer Llava with S^2?
#13
Gumpest
opened
2 weeks ago
3
How to train Llava with S^2?
#12
Chloe1997
opened
1 month ago
1
Question about the VIT baseline for imagenet classification
#11
mingkai-zheng
closed
1 month ago
4
About the choices in LLaVA+S^2 implementation
#10
jungle-gym-ac
opened
1 month ago
2
Hello, nice work, just wonder if used s2 in llava, will the vit can be trained?
#9
MonolithFoundation
opened
1 month ago
3
Why the res is 1008 in paper?
#8
OpenJarvisAI
opened
2 months ago
1
How to add multiscale_forward functionality to my model
#7
1chenchen22
opened
3 months ago
15
About non-square input
#6
githubiubiu
opened
3 months ago
4
Questions about paper
#5
JihwanEom
opened
3 months ago
1
Num of tokens in LLaVA
#4
RussRobin
closed
3 months ago
3
Any plan to release pre-trained weights?
#3
JihwanEom
opened
3 months ago
2
More Bench?
#2
BlueBlueFF
closed
3 months ago
1
Can you use this with SAM?
#1
jetjodh
closed
3 months ago
1