VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
2.04k
stars
164
forks
source link
Issue: The size of tensor a (2) must match the size of tensor b (8) at non-singleton dimension 0 #151
Open
apfsds3bm9 opened 1 week ago