return await dependant.call(*values)
File "/data/llm/Qwen/openai_api.py", line 395, in create_chat_completion
stop_words = add_extra_stop_words(request.stop)
File "/home/cis/.cache/huggingface/modules/transformers_modules/checkpoint-62-merged/modeling_qwen.py", line 1137, in chat
outputs = self.generate(
File "/home/cis/.cache/huggingface/modules/transformers_modules/checkpoint-62-merged/modeling_qwen.py", line 1259, in generate
return super().generate(
File "/home/cis/.conda/envs/llm/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(args, kwargs)
File "/home/cis/.conda/envs/llm/lib/python3.10/site-packages/transformers/generation/utils.py", line 1764, in generate
return self.sample(
File "/home/cis/.conda/envs/llm/lib/python3.10/site-packages/transformers/generation/utils.py", line 2897, in sample
next_tokens = torch.multinomial(probs, num_samples=1).squeeze(1)
RuntimeError: probability tensor contains either inf, nan or element < 0**
用Qwen的openai_api.py加载模型
python openai_api.py --server-name 0.0.0.0 --server-port 8000 -c /data2/swift/output/qwen-1_8b-chat/v0-20231227-150304/checkpoint-62-merged
带function的聊天
我试过只调用一次带functions的聊天,偶尔可以成功,大部分时候出以下错误
return await dependant.call(*values) File "/data/llm/Qwen/openai_api.py", line 395, in create_chat_completion stop_words = add_extra_stop_words(request.stop) File "/home/cis/.cache/huggingface/modules/transformers_modules/checkpoint-62-merged/modeling_qwen.py", line 1137, in chat outputs = self.generate( File "/home/cis/.cache/huggingface/modules/transformers_modules/checkpoint-62-merged/modeling_qwen.py", line 1259, in generate return super().generate( File "/home/cis/.conda/envs/llm/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return func(args, kwargs) File "/home/cis/.conda/envs/llm/lib/python3.10/site-packages/transformers/generation/utils.py", line 1764, in generate return self.sample( File "/home/cis/.conda/envs/llm/lib/python3.10/site-packages/transformers/generation/utils.py", line 2897, in sample next_tokens = torch.multinomial(probs, num_samples=1).squeeze(1) RuntimeError: probability tensor contains either
inf
,nan
or element < 0**