NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Apache License 2.0
1.85k stars 149 forks source link

Fix vision engine build #64

Closed meenchen closed 4 months ago