Closed Jeremy-J-J closed 2 months ago
Please wait for the next release of LMDeploy. Or, you may build lmdeploy from source. The model was supported in https://github.com/InternLM/lmdeploy/pull/2207 lately.
Please wait for the next release of LMDeploy. Or, you may build lmdeploy from source. The model was supported in #2207 lately.请等待 LMDeploy 的下一个版本。或者,您可以从源代码构建 lmdeploy。该模型 #2207 最近得到了支持。
tks, I will try
v0.5.3 is released. May give it a try.
This issue is marked as stale because it has been marked as invalid or awaiting response for 7 days without any further response. It will be closed in 5 days if the stale label is not removed or if there is no further response.
This issue is closed because it has been stale for 5 days. Please open a new issue if you have similar issues or you have any new updates now.
您好,我在使用0.5.3的lmdeploy对internvl2-4B进行awq量化的时候,最后保存模型的遇到了下面的问题,我想请问一下是怎么回事呢,麻烦大佬了。
我在对 IntrenVL2-1B awq量化的时候遇到了相同的问题,请问是怎么回事呢?
@AllentDan
Downgrade transformers version please.
Checklist
Describe the bug
InternVL2-1B awq 4bit量化后推理异常,推理耗时异常(A6000上不量化推理耗时1.2s,量化后推理耗时18s)且输出异常,输出一堆 """"""""
Reproduction
量化脚本
将量化完成后checkpoint-merged.int4下文件拷贝至internvl-1B-4bit
量化后推理脚本
Environment