xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
https://inference.readthedocs.io
Apache License 2.0
5.34k stars 433 forks source link

Xinference Chat Bot 每次对话多轮就会卡死 #1192

Closed andylzming closed 3 months ago

andylzming commented 7 months ago

Describe the bug

Xinference Chat Bot 每次对话多轮(一般两三轮)就会卡死,详见截图。

11

12

To Reproduce

To help us to reproduce this bug, please provide information below:

  1. Python :3.10.6
  2. xinference : 0.9.4
  3. Versions of crucial packages.
  4. Full stack of the error.
  5. Minimized code to reproduce the error.

Expected behavior

A clear and concise description of what you expected to happen.

Additional context

Add any other context about the problem here.

ChengjieLi28 commented 7 months ago

@andylzming 。我用一样的模型可以复现此问题(不一定百分百,我换个模型有时不会触发),gradio 版本:

gradio                        3.50.1
gradio_client                 0.6.1

打开F12可以看到console上有报错,然后网络中的ws中其实模型的回答已经传回来了,只是gradio没显示出来。猜测gradio版本有问题。 详见:https://github.com/gradio-app/gradio/issues/6613https://github.com/gradio-app/gradio/issues/3943

按照issue里面,gradio降级到3.41,我就再也没出现这样的问题,你可以试下。

andylzming commented 7 months ago

@ChengjieLi28

gradio降级到3.41,submit 按钮点击无效。

以下两个版本都会出现对话卡死现象

(xinference) [root@gpu-server gradio]# pip list | grep gradio
gradio                        3.47.1
gradio_client                 0.6.0
(xinference) [root@gpu-server depends]# ll xinference-dependences/ | grep gradio
-rw-r--r--. 1 root root  20298198 12月 19 21:55 gradio-3.50.2-py3-none-any.whl
-rw-r--r--. 1 root root    299220 12月 19 21:55 gradio_client-0.6.1-py3-none-any.whl

控制台如下: 111

112

andylzming commented 7 months ago

qwen-14b 模型对话多轮正常,chatglm3-6b 不行。 另外,通过 dify 使用 xinference 与 chatglm3-6b 通信报以下错误:

github-actions[bot] commented 3 months ago

This issue is stale because it has been open for 7 days with no activity.

github-actions[bot] commented 3 months ago

This issue was closed because it has been inactive for 5 days since being marked as stale.