NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Apache License 2.0
973 stars 68 forks source link

Access to pretrained model weights #68

Open zzxslp opened 1 month ago

zzxslp commented 1 month ago

Hi, thanks for the great work!

May I know if we have access to non-instruct models? (e.g., models after stage-1 or stage-2). For my specific research use case, I would love to access the model weights after mm_align + pretrain (before SFT).

yaolug commented 1 month ago

We will release stage-2 model checkpoints soon.

zzxslp commented 1 month ago

Thanks, looking forward to it.

UelisonSantos commented 1 week ago

Hello, When will the model checkpoint be released? I am working on a paper and would love to include VILA in the tests. This would be a tremendous help for me!