[X] 1. I have searched related issues but cannot get the expected help.
[X] 2. The bug has not been fixed in the latest version.
[X] 3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
Checklist
Describe the bug
怀疑可能是因为我使用了llm-compressor的量化脚本?https://github.com/vllm-project/llm-compressor
Reproduction
lmdeploy serve api_server /mnt/e/Code/models/Orca-2-13b-W8A8 --server-port 8000 --tp 2 --dtype float16 --backend pytorch
Environment
Error traceback