open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
https://huggingface.co/spaces/opencompass/open_vlm_leaderboard
Apache License 2.0
1.08k stars 154 forks source link

[Models] add moondream1 and moondream2 models #481

Closed tackhwa closed 4 days ago

tackhwa commented 1 week ago
tackhwa commented 1 week ago

@kennymckormick i am not sure about the variable INSTALL_REQ and INTERLEAVE, may i know what is the meaning behind?

kennymckormick commented 4 days ago

@kennymckormick i am not sure about the variable INSTALL_REQ and INTERLEAVE, may i know what is the meaning behind?

Hi, @tackhwa , Those are just some indicators:

  1. INSTALL_REQ == True means the VLM needs the corresponding github repo installed to run.
  2. Interleave == True means the VLM supports arbitrary interleaved image & text as inputs. If set to False, the VLM may only supports a single image and text string as inputs.
tackhwa commented 4 days ago

@kennymckormick i am not sure about the variable INSTALL_REQ and INTERLEAVE, may i know what is the meaning behind?

Hi, @tackhwa , Those are just some indicators:

  1. INSTALL_REQ == True means the VLM needs the corresponding github repo installed to run.
  2. Interleave == True means the VLM supports arbitrary interleaved image & text as inputs. If set to False, the VLM may only supports a single image and text string as inputs.

@kennymckormick thanks for the reply, the PR is ready now, please take your time to review it, appreciate.