Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
新版本似乎不能部署bge-m3,报错了 #1950

Closed leslie2046 closed 1 month ago

leslie2046 commented 1 month ago

System Info / 系統信息

centos 7.9 python3.10.6

The command used to start Xinference / 用以启动 xinference 的命令

xinference launch --model-name bge-m3 --model-type embedding -r 4 --n-gpu 2 Launch model name: bge-m3 with kwargs: {} Traceback (most recent call last): File "/home/njue/anaconda3/envs/xinference/bin/xinference", line 8, in sys.exit(cli()) File "/home/njue/anaconda3/envs/xinference/lib/python3.10/site-packages/click/core.py", line 1157, in call return self.main(args, kwargs) File "/home/njue/anaconda3/envs/xinference/lib/python3.10/site-packages/click/core.py", line 1078, in main rv = self.invoke(ctx) File "/home/njue/anaconda3/envs/xinference/lib/python3.10/site-packages/click/core.py", line 1688, in invoke return _process_result(sub_ctx.command.invoke(sub_ctx)) File "/home/njue/anaconda3/envs/xinference/lib/python3.10/site-packages/click/core.py", line 1434, in invoke return ctx.invoke(self.callback, ctx.params) File "/home/njue/anaconda3/envs/xinference/lib/python3.10/site-packages/click/core.py", line 783, in invoke return __callback(args, *kwargs) File "/home/njue/anaconda3/envs/xinference/lib/python3.10/site-packages/click/decorators.py", line 33, in new_func return f(get_current_context(), args, **kwargs) File "/home/njue/anaconda3/envs/xinference/lib/python3.10/site-packages/xinference/deploy/cmdline.py", line 903, in model_launch model_uid = client.launch_model( File "/home/njue/anaconda3/envs/xinference/lib/python3.10/site-packages/xinference/client/restful/restful_client.py", line 1041, in launch_model raise RuntimeError( RuntimeError: Failed to launch model, detail: [address=, pid=138956] Failed to import transformers.trainer because of the following error (look up to see its traceback): cannot import name 'is_mlu_available' from 'accelerate.utils' (/home/njue/anaconda3/envs/xinference/lib/python3.10/site-packages/accelerate/utils/init.py)

xinference launch --model-name bge-m3 --model-type embedding -r 4 --n-gpu 2

lhs0627 commented 1 month ago


leslie2046 commented 1 month ago

@lhs0627 是的解决了

lhs0627 commented 1 month ago

@leslie2046 我使用xinference launch --model-name bge-m3 --model-type embedding这个命令就一直报错,RuntimeError: Failed to launch model, detail: [address=, pid=4556] Failed to download model 'bge-m3' after multiple retries,请问你知道该如何解决吗?

leslie2046 commented 1 month ago

@lhs0627 因为你没有使用代理去下载模型,或者你要设置下环境变量HF_ENDPOINT=https://hf-mirror.com