[Model]: Support for InternVL2

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Apache License 2.0

29.52k stars 4.43k forks source link

🚀 The feature, motivation and pitch

InternVL2 is currently the most powerful open-source Multimodal Large Language Model (MLLM). The InternVL2 family includes models ranging from a 2B model, suitable for edge devices, to a 108B model, which is significantly more powerful. With larger-scale language models, InternVL2-Pro demonstrates outstanding multimodal understanding capabilities, matching the performance of commercial closed-source models across various benchmarks.

Given the significant potential of InternVL2, we believe that integrating it with vLLM would greatly benefit both the vLLM community and users of this model. We kindly request your assistance in enabling the deployment of InternVL2 using the vLLM framework.

We look forward to your positive response and are eager to collaborate on this exciting endeavor.

Alternatives

No response

Additional context

Blog：https://internvl.github.io/blog/2024-07-02-InternVL-2.0/ Model Family：https://huggingface.co/collections/OpenGVLab/internvl-20-667d3961ab5eb12c7ed1463e

vllm-project / vllm