InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
https://lmdeploy.readthedocs.io/en/latest/
Apache License 2.0
3.13k stars 280 forks source link

[Feature] Please add support for Qwen2 #1805

Closed mihara-bot closed 6 days ago

mihara-bot commented 1 week ago

Motivation

It is convenient to use current SOTA LLMs to inference. Many thanks for your help!

Related resources

Here is the model list: https://huggingface.co/collections/Qwen/qwen2-6659360b33528ced941e557f https://huggingface.co/Qwen/Qwen2-72B https://huggingface.co/Qwen/Qwen2-7B-Instruct https://huggingface.co/Qwen/Qwen2-72B-Instruct

Additional context

No response

zhyncs commented 1 week ago

Currently supported

mihara-bot commented 6 days ago

Thanks for your help! I've tried Qwen2 series models and it definitely works.