Closed jsyqrt closed 1 month ago
我也是这个错,xinference部署出现问题,请问解决了吗
seems like they deleted the function "stream_chat"
It's addressed in #1876 and will be included in next version. Feel free to reopen this issue if it does not work when new version released.
System Info / 系統信息
python --version Python 3.11.0
Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?
Version info / 版本信息
xinference -v xinference, version 0.13.1
The command used to start Xinference / 用以启动 xinference 的命令
XINFERENCE_MODEL_SRC=modelscope xinference-local --host 0.0.0.0 --port 9997
Reproduction / 复现过程
启动模型 xinference launch --model-engine http://0.0.0.0:9997 -n glm4-chat -s 9 -f pytorch -q none -en transformers 然后做推理
xinference 服务端详细日志
Expected behavior / 期待表现
希望能正常推理,不会报错