open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
https://huggingface.co/spaces/opencompass/open_vlm_leaderboard
Apache License 2.0
1.34k stars 188 forks source link

[Model] Phi3.5 Added #396

Closed amitbcp closed 2 months ago

amitbcp commented 2 months ago

Link : https://huggingface.co/microsoft/Phi-3.5-vision-instruct

The model provides uses for general purpose AI systems and applications with visual and text input capabilities which require:

  1. Memory/compute constrained environments
  2. Latency bound scenarios
  3. General image understanding
  4. Optical character recognition
  5. Chart and table understanding
  6. Multiple image comparison
  7. Multi-image or video clip summarization