Open austingg opened 4 days ago
Can you run this command lmdeploy serve gradio internlm/internlm2-chat-7b-4bits --model-format awq --tp 2
successfully?
lmdeploy serve gradio /models/Mini-InternVL-Chat-2B-V1-5-AWQ --model-format awq --server-port 10000
is ok.
I need download internlm/internlm2-chat-7b-4bits
models, wait for a while
You can try lmdeploy serve gradio /models/Mini-InternVL-Chat-2B-V1-5-AWQ --model-format awq --server-port 10000 --tp 2
I think it might related to nccl.
lmdeploy serve gradio /models/Mini-InternVL-Chat-2B-V1-5-AWQ --model-format awq --server-port 10000 --tp 2
is ok too.
Besides, lmdeploy chat /models/InternVL-Chat-V1-5-AWQ/ --model-format awq --tp 2
is ok.
Only serve api_server/gradio
crash
Checklist
Describe the bug
Segmentation fault when I deploy the InternVL-Chat-V1-5-AWQ on T4 using
openmmlab/lmdeploy:latest
imageReproduction
lmdeploy serve gradio /models/InternVL-Chat-V1-5-AWQ/ --model-format awq --tp 2
Environment
Error traceback