Closed summerrain321 closed 5 months ago
本地测试没有复现,可以用docker镜像试一下。
docker pull openmmlab/lmdeploy:v0.4.2
This issue is marked as stale because it has been marked as invalid or awaiting response for 7 days without any further response. It will be closed in 5 days if the stale label is not removed or if there is no further response.
This issue is closed because it has been stale for 5 days. Please open a new issue if you have similar issues or you have any new updates now.
Checklist
Describe the bug
当启动110b的服务时,以下命令长时间没有结果 curl http://localhost:端口号/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "qwen", "messages": [ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "Who won the world series in 2020?"} ] }'
当启动72b的服务时,以上命令可以很快给出结果
Reproduction
lmdeploy serve api_server PATH/Qwen1.5-110B-Chat --server-port 端口号(任意) --model-name qwen --tp 8(或4)
Environment
Error traceback
No response