NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Apache License 2.0
973 stars 68 forks source link

OpenVLM leaderboard #74

Open oroojlooy opened 2 weeks ago

oroojlooy commented 2 weeks ago

Do you have any plan to get involved in OpenVLM leaderboard? https://huggingface.co/spaces/opencompass/open_vlm_leaderboard I think that needs some efforts from your side, but given the performance of VILA provides you good visibility.

Lyken17 commented 6 days ago

@oroojlooy sure, we would love to. Our models and inference examples are already released. Any other efforts we need to get involved in the benchmark?