Qwen2-1.5B-Instruct重复回答的问题 #540

bestzy6 commented 2 months ago

我测试Qwen2-1.5B-Instruct时遇到了重复回答的问题,这属于Bug吗? 模型是在modelscope上下载的,没有调整配置参数。


{ "model": "Qwen2-1.5B-Instruct", "messages": [ { "role": "system", "content": "Please write a passage to answer the question. Try to include as many key details as possible. Write by Chinese!" }, { "role": "user", "content": "context: XXX \n\n Passage:" } ], "stream": false, "max_tokens": 1024, "temperature": 0.4 }



jklj077 commented 2 months ago


I am unable to reproduce this with the given input. Could you share a full input for us to reproduce the issue?

bestzy6 commented 2 months ago


I am unable to reproduce this with the given input. Could you share a full input for us to reproduce the issue?

full input 是 “比亚迪电动车项目相关的文档”,这个问题是偶现的。

jklj077 commented 2 months ago


I have tried to reproduce this with transformers over 20 times with the given input and no luck for me. So I don't think it is a model issue.

What frameworks were you using?

bestzy6 commented 2 months ago

@jklj077 我使用了vllm进行推理,显卡是T4。

Package Version

mide123 commented 2 months ago


Kk1984up commented 2 months ago

我用vllm部署qwen2-72b-instruct也是经常输出输出的问题,这个是模型本身的问题吧,还是vllm部署的问题 @jklj077

JustinLin610 commented 2 months ago

Have you guys tried the demo on HF and ModelScope? You can compare that with your own deployment. And as hyperparameters matter, I don't think changing the temperature to 0.4 is a good choice. Please try with the original hyperparameters for generation.

mide123 commented 2 months ago

Thank you very much for your reply. I'll give it a try

AllenLeong commented 2 months ago

I have the same issue using Qwen2-1.5B. The output keep giving the next word prediction. It seems the model is not able to predict an [END] token. You can reproduce it in my Colab image

JacquelineXu commented 2 months ago

@AllenLeong Your model_name is Qwen2-1.5B. It's a base model trained on the next token prediction task. Use qwen/Qwen2-1.5B-Instruct instead.

AllenLeong commented 2 months ago

@AllenLeong Your model_name is Qwen2-1.5B. It's a base model trained on the next token prediction task. Use qwen/Qwen2-1.5B-Instruct instead.

Thanks, thanks, it's very helpful

github-actions[bot] commented 1 month ago

acupofespresso commented 2 weeks ago

参考这个文档: https://qwen.readthedocs.io/zh-cn/latest/deployment/vllm.html# 设置重复惩罚参数,可以解决重复生成的问题 repetition_penalty=1.05

如果是调用OpenAI的API,可以设置 frequency_penalty=1.05