InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
https://lmdeploy.readthedocs.io/en/latest/
Apache License 2.0
4.74k stars 432 forks source link

[Feature] qwen2 vl support the turbomind engine #2774

Open DexterGuo opened 1 week ago

DexterGuo commented 1 week ago

Motivation

1、The qwen2vl effect is the sota level in the open source model 2、lmdeploy is an excellent inference framework 3、So it's important to support turbomind

Related resources

No response

Additional context

No response

lvhan028 commented 1 week ago

2720 is working on it.

ciwang commented 3 days ago

Hi @lvhan028, do you have an estimate of when that PR will be merged? Thank you in advance!