open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks
https://huggingface.co/spaces/opencompass/open_vlm_leaderboard
Apache License 2.0
721 stars 85 forks source link

Image embeddings #245

Closed chandrabhuma closed 3 weeks ago

chandrabhuma commented 3 weeks ago

Is it possible to obtain image embeddings from vision encoders of VLMs in this kit?

junming-yang commented 3 weeks ago

We are focused on VLM's prediction and result evaluation and currently do not support obtaining image embeddings.

chandrabhuma commented 3 weeks ago

Thank you for the prompt response

chandrabhuma commented 3 weeks ago

Thank you for the prompt response