Efficient-Large-Model / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Apache License 2.0
878 stars 55 forks source link

OpenVLM leaderboard #74

Open oroojlooy opened 1 week ago

oroojlooy commented 1 week ago

Do you have any plan to get involved in OpenVLM leaderboard? https://huggingface.co/spaces/opencompass/open_vlm_leaderboard I think that needs some efforts from your side, but given the performance of VILA provides you good visibility.