Closed weicheng59 closed 8 months ago
update:
using LMDeploy: 0.0.14+62282fe
change the config.ini's session_len to 4096
and copy model module of yi to model.py with session_len=4096 in arg
self.session_len = session_len
in init
I can get response normally using 1987 Chinese charactor request
@weicheng59 we are rushing for lmdeploy v0.2.0, which is planned to be released on 1.15 @AllentDan will get back to this issue after the rush
Your reply is appreciated. I will just use 0.0.14 version for now. Will help out test this issue in v0.2.0 after your release and good luck on the rush.
this issue is not present on the latest build v0.2.0.
Checklist
Describe the bug
Model:
https://huggingface.co/01-ai/Yi-34B-Chat https://huggingface.co/NousResearch/Nous-Capybara-34B
Api usage:
url: http://192.168.40.41:8888/v1/chat/interactive
body: 1986 Chinese charactor request
response is normal:
body: 1987 Chinese charactor request
response is empty:
At first I thought its the problem of Yi-34B-Chat only support 4k context so I tried Nous-Capybara-34B which should have 200k context but the problem is the same. I recalled from a previous version that i dont remember eactly which, setting the --session_len 10000 will solve this problem but in the lastest version it does not seemed to do anything.
Let me know if you need any more detail.
Reproduction
CMD to serve:
lmdeploy serve api_server /path/to/yi --model-name yi --session_len 10000 --server_name 0.0.0.0 --server_port 8886 --instance_num 128 --tp 1
Environment
Error traceback
No response