xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
https://inference.readthedocs.io
Apache License 2.0
4.75k stars 372 forks source link

BUG: failed to load models #496

Closed zysno1 closed 11 months ago

zysno1 commented 11 months ago

Describe the bug

I have built a Docker image myself and deployed xinference on k8s. The homepage can be accessed normally. However, loading the model failed, and the error message is 'not found'. This error is consistently reproducible, making xinference basically unusable.

To Reproduce

To help us to reproduce this bug, please provide information below:

  1. Your Python version. 3.10
  2. The version of xinference you use. latest
  3. Versions of crucial packages.
  4. Full stack of the error.
  5. Minimized code to reproduce the error.

Expected behavior

A clear and concise description of what you expected to happen.

Additional context

dockerfile

FROM python:3.10

ENV XINFERENCE_MODEL_SRC=xorbits

RUN pip install --upgrade pip setuptools -i https://pypi.tuna.tsinghua.edu.cn/simple/

RUN pip install "xinference[all]" -i https://pypi.tuna.tsinghua.edu.cn/simple/

EXPOSE 9997

CMD ["/usr/local/bin/xinference","-H 0.0.0.0", "-p 9997"]

image image image
UranusSeven commented 11 months ago

Hi, thanks for reporting this issue!

The error message Not found on the webpage indicates there's something wrong when loading the model. I see you have attached part of the server side logs, could you attach the full logs?

UranusSeven commented 11 months ago

The problem seems to be relavent to XINFERENCE_MODEL_SRC=xorbits. This env var force xinference downloading models from our s3 bucket, which is not accessible right now.

We've added support of downloading from modelscope, and you can enable this feature byXINFERENCE_MODEL_SRC=modelscope.

Bojun-Feng commented 11 months ago

I also encountered a similar issue, Python version is 3.11.4 and using newest version of Xinference on GitHub Screenshot 2023-09-28 at 10 12 45 AM

I would recommend adding the argument --log-level DEBUG to see what is going on when launching the model, in my case the issue originates from launching the model in the Supervisor before building the gradio page.

The core issue seems to be in the launch model part of the code:

  File "/Users/bojunfeng/cs/inference/xinference/core/utils.py", line 27, in wrapped
    ret = await func(*args, **kwargs)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/bojunfeng/cs/inference/xinference/core/worker.py", line 182, in launch_builtin_model
    subpool_address, devices = await self._create_subpool(model_uid, n_gpu=n_gpu)
                               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/bojunfeng/cs/inference/xinference/core/worker.py", line 111, in _create_subpool
    sub_pool_address = await self._main_pool.append_sub_pool(
                             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'MainActorPool' object has no attribute 'append_sub_pool'

Since the model is not launched, later requests to create a Gradio page for the model also fails, causing the exact same error "model not found in model list" since the model is not launched at all and therefore does not exist, resulting in a "model not found" error page instead of the Gradio interface as expected.

Complete Log from DEBUG mode since Pressing the Button ``` 2023-09-28 09:19:53,235 xinference.core.supervisor 62053 DEBUG Enter launch_builtin_model, model_uid: 1783edb0-5e0a-11ee-8b3d-af674e7ac1bf, model_name: gpt-2, model_size: 1, model_format: ggmlv3, quantization: none, replica: 1 2023-09-28 09:19:53,236 xinference.core.worker 62053 DEBUG Enter get_model_count, args: (,), kwargs: {} 2023-09-28 09:19:53,236 xinference.core.worker 62053 DEBUG Leave get_model_count, elapsed time: 0 ms 2023-09-28 09:19:53,236 xinference.core.worker 62053 DEBUG Enter launch_builtin_model, args: (,), kwargs: {'model_uid': '1783edb0-5e0a-11ee-8b3d-af674e7ac1bf-1-0', 'model_name': 'gpt-2', 'model_size_in_billions': 1, 'model_format': 'ggmlv3', 'quantization': 'none', 'model_type': 'LLM', 'n_gpu': 'auto'} 2023-09-28 09:19:53,237 xinference.core.supervisor 62053 DEBUG Enter is_local_deployment, args: (,), kwargs: {} 2023-09-28 09:19:53,237 xinference.core.supervisor 62053 DEBUG Leave is_local_deployment, elapsed time: 0 ms 2023-09-28 09:19:53,245 xinference.model.llm.llm_family 62053 INFO Caching from Hugging Face: marella/gpt-2-ggml 2023-09-28 09:19:53,296 urllib3.connectionpool 62053 DEBUG Starting new HTTPS connection (1): huggingface.co:443 2023-09-28 09:19:53,357 urllib3.connectionpool 62053 DEBUG https://huggingface.co:443 "HEAD /marella/gpt-2-ggml/resolve/main/ggml-model.bin HTTP/1.1" 302 0 2023-09-28 09:19:53,381 xinference.model.llm.core 62053 DEBUG Launching 1783edb0-5e0a-11ee-8b3d-af674e7ac1bf-1-0 with CtransformersModel 2023-09-28 09:19:53,386 xinference.core.supervisor 62053 DEBUG Enter terminate_model, args: (, '1783edb0-5e0a-11ee-8b3d-af674e7ac1bf'), kwargs: {'suppress_exception': True} 2023-09-28 09:19:53,386 xinference.core.supervisor 62053 DEBUG Leave terminate_model, elapsed time: 0 ms 2023-09-28 09:19:53,386 xinference.core.restful_api 62053 ERROR 'MainActorPool' object has no attribute 'append_sub_pool' Traceback (most recent call last): File "/Users/bojunfeng/cs/inference/xinference/core/restful_api.py", line 408, in launch_model model_uid = await self._supervisor_ref.launch_builtin_model( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "xoscar/core.pyx", line 288, in __pyx_actor_method_wrapper File "xoscar/core.pyx", line 422, in _handle_actor_result File "xoscar/core.pyx", line 465, in _run_actor_async_generator File "xoscar/core.pyx", line 466, in xoscar.core._BaseActor._run_actor_async_generator File "xoscar/core.pyx", line 471, in xoscar.core._BaseActor._run_actor_async_generator File "/Users/bojunfeng/cs/inference/xinference/core/supervisor.py", line 227, in launch_builtin_model yield _launch_one_model(rep_model_uid) File "xoscar/core.pyx", line 476, in xoscar.core._BaseActor._run_actor_async_generator File "xoscar/core.pyx", line 422, in _handle_actor_result File "xoscar/core.pyx", line 465, in _run_actor_async_generator File "xoscar/core.pyx", line 466, in xoscar.core._BaseActor._run_actor_async_generator File "xoscar/core.pyx", line 471, in xoscar.core._BaseActor._run_actor_async_generator File "/Users/bojunfeng/cs/inference/xinference/core/supervisor.py", line 206, in _launch_one_model yield worker_ref.launch_builtin_model( File "xoscar/core.pyx", line 476, in xoscar.core._BaseActor._run_actor_async_generator File "xoscar/core.pyx", line 396, in _handle_actor_result File "xoscar/core.pyx", line 284, in __pyx_actor_method_wrapper File "xoscar/core.pyx", line 287, in xoscar.core.__pyx_actor_method_wrapper File "/Users/bojunfeng/cs/inference/xinference/core/utils.py", line 27, in wrapped ret = await func(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/bojunfeng/cs/inference/xinference/core/worker.py", line 182, in launch_builtin_model subpool_address, devices = await self._create_subpool(model_uid, n_gpu=n_gpu) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/bojunfeng/cs/inference/xinference/core/worker.py", line 111, in _create_subpool sub_pool_address = await self._main_pool.append_sub_pool( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ AttributeError: 'MainActorPool' object has no attribute 'append_sub_pool' 2023-09-28 09:19:53,442 urllib3.connectionpool 62053 DEBUG Starting new HTTP connection (1): 127.0.0.1:9997 2023-09-28 09:19:53,443 xinference.core.supervisor 62053 DEBUG Enter describe_model, args: (, '1783edb0-5e0a-11ee-8b3d-af674e7ac1bf'), kwargs: {} 2023-09-28 09:19:53,443 xinference.core.restful_api 62053 ERROR Model not found in the model list, uid: 1783edb0-5e0a-11ee-8b3d-af674e7ac1bf Traceback (most recent call last): File "/Users/bojunfeng/cs/inference/xinference/core/restful_api.py", line 365, in describe_model return await self._supervisor_ref.describe_model(model_uid) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "xoscar/core.pyx", line 284, in __pyx_actor_method_wrapper File "xoscar/core.pyx", line 287, in xoscar.core.__pyx_actor_method_wrapper File "/Users/bojunfeng/cs/inference/xinference/core/utils.py", line 27, in wrapped ret = await func(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/bojunfeng/cs/inference/xinference/core/supervisor.py", line 300, in describe_model raise ValueError(f"Model not found in the model list, uid: {model_uid}") ValueError: Model not found in the model list, uid: 1783edb0-5e0a-11ee-8b3d-af674e7ac1bf 2023-09-28 09:19:53,443 urllib3.connectionpool 62053 DEBUG http://127.0.0.1:9997 "GET /v1/models/1783edb0-5e0a-11ee-8b3d-af674e7ac1bf HTTP/1.1" 400 89 2023-09-28 09:19:53,443 xinference.core.restful_api 62053 ERROR Failed to get the model description, detail: Model not found in the model list, uid: 1783edb0-5e0a-11ee-8b3d-af674e7ac1bf Traceback (most recent call last): File "/Users/bojunfeng/cs/inference/xinference/core/restful_api.py", line 457, in build_interface gr.mount_gradio_app(self._app, interface.build(), f"/{model_uid}") ^^^^^^^^^^^^^^^^^ File "/Users/bojunfeng/cs/inference/xinference/core/chat_interface.py", line 36, in build model = self.client.get_model(self.model_uid) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/bojunfeng/cs/inference/xinference/client.py", line 877, in get_model raise RuntimeError( RuntimeError: Failed to get the model description, detail: Model not found in the model list, uid: 1783edb0-5e0a-11ee-8b3d-af674e7ac1bf ```
Bojun-Feng commented 11 months ago

I have managed to solve this issue on my end, not sure if the root of the problem is the exact same, but I hope this information is helpful

TLDR: some dependency packages are not up to date, so reinstall Xinference with all dependencies

I noticed that the MainActorPool object is imported from another python module Xoscar, so if MainActorPool is causing the problem it is probably not Xinference's fault.

Took a look at setup.cfg:

install_requires = 
    xoscar>=0.1.2

Running pip show xoscar:

Name: xoscar
Version: 0.1.0

Steps I took to resolve the problem:

Full log updating Xinference with all dependencies ``` (base) bojunfeng@Bojuns-MacBook-Pro inference % pip install -e ".[all]" Obtaining file:///Users/bojunfeng/cs/inference Installing build dependencies ... done Checking if build backend supports build_editable ... done Getting requirements to build wheel ... done Installing backend dependencies ... done Preparing metadata (pyproject.toml) ... done Collecting xoscar>=0.1.2 (from xinference==0+untagged.256.g3a5082f) Obtaining dependency information for xoscar>=0.1.2 from https://files.pythonhosted.org/packages/98/b1/21f85cf42e396e95c57c9fcc2864538609fe07b9b09e9ee0d86bffa8c539/xoscar-0.1.2-cp311-cp311-macosx_10_9_universal2.whl.metadata Downloading xoscar-0.1.2-cp311-cp311-macosx_10_9_universal2.whl.metadata (9.2 kB) Requirement already satisfied: xorbits in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.5.2) Requirement already satisfied: gradio>=3.39.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (3.41.2) Requirement already satisfied: click in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (8.0.4) Requirement already satisfied: tqdm>=4.27 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (4.65.0) Requirement already satisfied: tabulate in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.8.10) Requirement already satisfied: requests in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (2.31.0) Requirement already satisfied: pydantic<2 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (1.10.12) Requirement already satisfied: fastapi in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.103.0) Requirement already satisfied: uvicorn in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.23.2) Requirement already satisfied: huggingface-hub<1.0,>=0.14.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.15.1) Requirement already satisfied: typing-extensions in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (4.7.1) Requirement already satisfied: fsspec in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (2023.4.0) Requirement already satisfied: s3fs in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (2023.4.0) Collecting modelscope (from xinference==0+untagged.256.g3a5082f) Obtaining dependency information for modelscope from https://files.pythonhosted.org/packages/45/da/b5f822b87870c1a4d413a9e0e55f97183a4b8e5d1ea49503f04ec84ef927/modelscope-1.9.1-py3-none-any.whl.metadata Downloading modelscope-1.9.1-py3-none-any.whl.metadata (33 kB) Requirement already satisfied: ctransformers in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.2.26) Collecting llama-cpp-python>=0.2.0 (from xinference==0+untagged.256.g3a5082f) Downloading llama_cpp_python-0.2.7.tar.gz (1.6 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.6/1.6 MB 9.9 MB/s eta 0:00:00 Installing build dependencies ... done Getting requirements to build wheel ... done Preparing metadata (pyproject.toml) ... done Requirement already satisfied: transformers>=4.31.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (4.32.1) Requirement already satisfied: torch in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (2.0.1) Requirement already satisfied: accelerate>=0.20.3 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.22.0) Requirement already satisfied: sentencepiece in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.1.99) Requirement already satisfied: transformers-stream-generator in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.0.4) Requirement already satisfied: bitsandbytes in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.41.1) Requirement already satisfied: protobuf in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (4.24.2) Requirement already satisfied: einops in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.6.1) Requirement already satisfied: tiktoken in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xinference==0+untagged.256.g3a5082f) (0.4.0) Collecting sentence-transformers (from xinference==0+untagged.256.g3a5082f) Downloading sentence-transformers-2.2.2.tar.gz (85 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 86.0/86.0 kB 4.8 MB/s eta 0:00:00 Preparing metadata (setup.py) ... done Requirement already satisfied: numpy>=1.17 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from accelerate>=0.20.3->xinference==0+untagged.256.g3a5082f) (1.24.3) Requirement already satisfied: packaging>=20.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from accelerate>=0.20.3->xinference==0+untagged.256.g3a5082f) (23.0) Requirement already satisfied: psutil in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from accelerate>=0.20.3->xinference==0+untagged.256.g3a5082f) (5.9.0) Requirement already satisfied: pyyaml in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from accelerate>=0.20.3->xinference==0+untagged.256.g3a5082f) (6.0) Requirement already satisfied: aiofiles<24.0,>=22.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (22.1.0) Requirement already satisfied: altair<6.0,>=4.2.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (5.1.1) Requirement already satisfied: ffmpy in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (0.3.1) Requirement already satisfied: gradio-client==0.5.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (0.5.0) Requirement already satisfied: httpx in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (0.24.1) Requirement already satisfied: importlib-resources<7.0,>=1.3 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (6.0.1) Requirement already satisfied: jinja2<4.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (3.1.2) Requirement already satisfied: markupsafe~=2.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (2.1.1) Requirement already satisfied: matplotlib~=3.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (3.7.1) Requirement already satisfied: orjson~=3.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (3.9.5) Requirement already satisfied: pandas<3.0,>=1.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (1.5.3) Requirement already satisfied: pillow<11.0,>=8.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (9.4.0) Requirement already satisfied: pydub in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (0.25.1) Requirement already satisfied: python-multipart in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (0.0.6) Requirement already satisfied: semantic-version~=2.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (2.10.0) Requirement already satisfied: websockets<12.0,>=10.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (11.0.3) Requirement already satisfied: filelock in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from huggingface-hub<1.0,>=0.14.1->xinference==0+untagged.256.g3a5082f) (3.9.0) Requirement already satisfied: diskcache>=5.6.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from llama-cpp-python>=0.2.0->xinference==0+untagged.256.g3a5082f) (5.6.3) Requirement already satisfied: charset-normalizer<4,>=2 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from requests->xinference==0+untagged.256.g3a5082f) (2.0.4) Requirement already satisfied: idna<4,>=2.5 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from requests->xinference==0+untagged.256.g3a5082f) (3.4) Requirement already satisfied: urllib3<3,>=1.21.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from requests->xinference==0+untagged.256.g3a5082f) (1.26.16) Requirement already satisfied: certifi>=2017.4.17 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from requests->xinference==0+untagged.256.g3a5082f) (2023.7.22) Requirement already satisfied: sympy in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from torch->xinference==0+untagged.256.g3a5082f) (1.11.1) Requirement already satisfied: networkx in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from torch->xinference==0+untagged.256.g3a5082f) (3.1) Requirement already satisfied: regex!=2019.12.17 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from transformers>=4.31.0->xinference==0+untagged.256.g3a5082f) (2022.7.9) Requirement already satisfied: tokenizers!=0.11.3,<0.14,>=0.11.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from transformers>=4.31.0->xinference==0+untagged.256.g3a5082f) (0.13.2) Requirement already satisfied: safetensors>=0.3.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from transformers>=4.31.0->xinference==0+untagged.256.g3a5082f) (0.3.3) Requirement already satisfied: h11>=0.8 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from uvicorn->xinference==0+untagged.256.g3a5082f) (0.14.0) Requirement already satisfied: cloudpickle>=1.5.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xoscar>=0.1.2->xinference==0+untagged.256.g3a5082f) (2.2.1) Requirement already satisfied: tblib>=1.7.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xoscar>=0.1.2->xinference==0+untagged.256.g3a5082f) (1.7.0) Requirement already satisfied: uvloop>=0.14.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xoscar>=0.1.2->xinference==0+untagged.256.g3a5082f) (0.17.0) Requirement already satisfied: scipy>=1.0.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xoscar>=0.1.2->xinference==0+untagged.256.g3a5082f) (1.10.1) Requirement already satisfied: py-cpuinfo<10.0.0,>=9.0.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from ctransformers->xinference==0+untagged.256.g3a5082f) (9.0.0) Requirement already satisfied: starlette<0.28.0,>=0.27.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from fastapi->xinference==0+untagged.256.g3a5082f) (0.27.0) Collecting addict (from modelscope->xinference==0+untagged.256.g3a5082f) Downloading addict-2.4.0-py3-none-any.whl (3.8 kB) Requirement already satisfied: attrs in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from modelscope->xinference==0+untagged.256.g3a5082f) (22.1.0) Requirement already satisfied: datasets<=2.13.0,>=2.8.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from modelscope->xinference==0+untagged.256.g3a5082f) (2.12.0) Collecting gast>=0.2.2 (from modelscope->xinference==0+untagged.256.g3a5082f) Downloading gast-0.5.4-py3-none-any.whl (19 kB) Collecting ms-swift (from modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for ms-swift from https://files.pythonhosted.org/packages/4e/ea/625a61ec5d3006b6f1d6c4ecdbc1cb6c28d16c4e1234edaa46d7d63406f8/ms_swift-1.1.0-py3-none-any.whl.metadata Downloading ms_swift-1.1.0-py3-none-any.whl.metadata (12 kB) Collecting oss2 (from modelscope->xinference==0+untagged.256.g3a5082f) Downloading oss2-2.18.2.tar.gz (273 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 273.1/273.1 kB 8.9 MB/s eta 0:00:00 Preparing metadata (setup.py) ... done Requirement already satisfied: pyarrow!=9.0.0,>=6.0.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from modelscope->xinference==0+untagged.256.g3a5082f) (11.0.0) Requirement already satisfied: python-dateutil>=2.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from modelscope->xinference==0+untagged.256.g3a5082f) (2.8.2) Requirement already satisfied: setuptools in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from modelscope->xinference==0+untagged.256.g3a5082f) (68.0.0) Collecting simplejson>=3.3.0 (from modelscope->xinference==0+untagged.256.g3a5082f) Downloading simplejson-3.19.1-cp311-cp311-macosx_11_0_arm64.whl (74 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 74.5/74.5 kB 6.0 MB/s eta 0:00:00 Requirement already satisfied: sortedcontainers>=1.5.9 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from modelscope->xinference==0+untagged.256.g3a5082f) (2.4.0) Requirement already satisfied: yapf in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from modelscope->xinference==0+untagged.256.g3a5082f) (0.31.0) Requirement already satisfied: aiobotocore~=2.5.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from s3fs->xinference==0+untagged.256.g3a5082f) (2.5.0) Requirement already satisfied: aiohttp!=4.0.0a0,!=4.0.0a1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from s3fs->xinference==0+untagged.256.g3a5082f) (3.8.3) Collecting torchvision (from sentence-transformers->xinference==0+untagged.256.g3a5082f) Downloading torchvision-0.15.2-cp311-cp311-macosx_11_0_arm64.whl (1.4 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.4/1.4 MB 10.6 MB/s eta 0:00:00 Requirement already satisfied: scikit-learn in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from sentence-transformers->xinference==0+untagged.256.g3a5082f) (1.3.0) Requirement already satisfied: nltk in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from sentence-transformers->xinference==0+untagged.256.g3a5082f) (3.8.1) Requirement already satisfied: tornado>=6.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xorbits->xinference==0+untagged.256.g3a5082f) (6.3.2) Requirement already satisfied: sqlalchemy>=1.2.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xorbits->xinference==0+untagged.256.g3a5082f) (1.4.39) Requirement already satisfied: defusedxml>=0.5.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from xorbits->xinference==0+untagged.256.g3a5082f) (0.7.1) Requirement already satisfied: botocore<1.29.77,>=1.29.76 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aiobotocore~=2.5.0->s3fs->xinference==0+untagged.256.g3a5082f) (1.29.76) Requirement already satisfied: wrapt>=1.10.10 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aiobotocore~=2.5.0->s3fs->xinference==0+untagged.256.g3a5082f) (1.14.1) Requirement already satisfied: aioitertools>=0.5.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aiobotocore~=2.5.0->s3fs->xinference==0+untagged.256.g3a5082f) (0.7.1) Requirement already satisfied: multidict<7.0,>=4.5 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aiohttp!=4.0.0a0,!=4.0.0a1->s3fs->xinference==0+untagged.256.g3a5082f) (6.0.2) Requirement already satisfied: async-timeout<5.0,>=4.0.0a3 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aiohttp!=4.0.0a0,!=4.0.0a1->s3fs->xinference==0+untagged.256.g3a5082f) (4.0.2) Requirement already satisfied: yarl<2.0,>=1.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aiohttp!=4.0.0a0,!=4.0.0a1->s3fs->xinference==0+untagged.256.g3a5082f) (1.8.1) Requirement already satisfied: frozenlist>=1.1.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aiohttp!=4.0.0a0,!=4.0.0a1->s3fs->xinference==0+untagged.256.g3a5082f) (1.3.3) Requirement already satisfied: aiosignal>=1.1.2 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aiohttp!=4.0.0a0,!=4.0.0a1->s3fs->xinference==0+untagged.256.g3a5082f) (1.2.0) Requirement already satisfied: jsonschema>=3.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from altair<6.0,>=4.2.0->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (4.17.3) Requirement already satisfied: toolz in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from altair<6.0,>=4.2.0->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (0.12.0) Requirement already satisfied: dill<0.3.7,>=0.3.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from datasets<=2.13.0,>=2.8.0->modelscope->xinference==0+untagged.256.g3a5082f) (0.3.6) Requirement already satisfied: xxhash in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from datasets<=2.13.0,>=2.8.0->modelscope->xinference==0+untagged.256.g3a5082f) (2.0.2) Requirement already satisfied: multiprocess in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from datasets<=2.13.0,>=2.8.0->modelscope->xinference==0+untagged.256.g3a5082f) (0.70.14) Requirement already satisfied: responses<0.19 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from datasets<=2.13.0,>=2.8.0->modelscope->xinference==0+untagged.256.g3a5082f) (0.13.3) Requirement already satisfied: contourpy>=1.0.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from matplotlib~=3.0->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (1.0.5) Requirement already satisfied: cycler>=0.10 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from matplotlib~=3.0->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (0.11.0) Requirement already satisfied: fonttools>=4.22.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from matplotlib~=3.0->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (4.25.0) Requirement already satisfied: kiwisolver>=1.0.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from matplotlib~=3.0->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (1.4.4) Requirement already satisfied: pyparsing>=2.3.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from matplotlib~=3.0->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (3.0.9) Requirement already satisfied: pytz>=2020.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from pandas<3.0,>=1.0->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (2022.7) Requirement already satisfied: six>=1.5 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from python-dateutil>=2.1->modelscope->xinference==0+untagged.256.g3a5082f) (1.16.0) Requirement already satisfied: joblib>=1.1.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from scikit-learn->sentence-transformers->xinference==0+untagged.256.g3a5082f) (1.2.0) Requirement already satisfied: threadpoolctl>=2.0.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from scikit-learn->sentence-transformers->xinference==0+untagged.256.g3a5082f) (2.2.0) Requirement already satisfied: anyio<5,>=3.4.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from starlette<0.28.0,>=0.27.0->fastapi->xinference==0+untagged.256.g3a5082f) (3.5.0) Requirement already satisfied: httpcore<0.18.0,>=0.15.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from httpx->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (0.17.3) Requirement already satisfied: sniffio in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from httpx->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (1.2.0) Collecting diffusers>=0.18.0 (from ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for diffusers>=0.18.0 from https://files.pythonhosted.org/packages/6c/73/4a4a0961d2e21ebd869556d4cb334697cd3ab9cb9f13f24b1f1f2ca1e3ee/diffusers-0.21.3-py3-none-any.whl.metadata Downloading diffusers-0.21.3-py3-none-any.whl.metadata (18 kB) Collecting jieba (from ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Downloading jieba-0.42.1.tar.gz (19.2 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.2/19.2 MB 13.8 MB/s eta 0:00:00 Preparing metadata (setup.py) ... done Collecting loralib>=0.1.2 (from ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for loralib>=0.1.2 from https://files.pythonhosted.org/packages/f1/e7/a4362bf791bca17d2d91e7c69483185ab03d5aa05dd10391eff2e179a685/loralib-0.1.2-py3-none-any.whl.metadata Downloading loralib-0.1.2-py3-none-any.whl.metadata (15 kB) Collecting peft>=0.5.0 (from ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for peft>=0.5.0 from https://files.pythonhosted.org/packages/37/1a/8d20e8704da9fa070eb909265584b960da57be1d833d550c59f50906dc5c/peft-0.5.0-py3-none-any.whl.metadata Downloading peft-0.5.0-py3-none-any.whl.metadata (22 kB) Collecting rouge (from ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Downloading rouge-1.0.1-py3-none-any.whl (13 kB) Collecting tensorboard (from ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for tensorboard from https://files.pythonhosted.org/packages/73/a2/66ed644f6ed1562e0285fcd959af17670ea313c8f331c46f79ee77187eb9/tensorboard-2.14.1-py3-none-any.whl.metadata Downloading tensorboard-2.14.1-py3-none-any.whl.metadata (1.7 kB) Collecting crcmod>=1.7 (from oss2->modelscope->xinference==0+untagged.256.g3a5082f) Downloading crcmod-1.7.tar.gz (89 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 89.7/89.7 kB 8.0 MB/s eta 0:00:00 Preparing metadata (setup.py) ... done Collecting pycryptodome>=3.4.7 (from oss2->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for pycryptodome>=3.4.7 from https://files.pythonhosted.org/packages/d6/df/295e56dca0b4834665626326359c5a1f3092287be56db4b316926df7ad0c/pycryptodome-3.19.0-cp35-abi3-macosx_10_9_universal2.whl.metadata Downloading pycryptodome-3.19.0-cp35-abi3-macosx_10_9_universal2.whl.metadata (3.4 kB) Collecting aliyun-python-sdk-kms>=2.4.1 (from oss2->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for aliyun-python-sdk-kms>=2.4.1 from https://files.pythonhosted.org/packages/3d/ea/d88e08bfc4a0aee0111f1f24c98b19107bc6783441e7e944907c77b2243d/aliyun_python_sdk_kms-2.16.2-py2.py3-none-any.whl.metadata Downloading aliyun_python_sdk_kms-2.16.2-py2.py3-none-any.whl.metadata (1.5 kB) Collecting aliyun-python-sdk-core>=2.13.12 (from oss2->modelscope->xinference==0+untagged.256.g3a5082f) Downloading aliyun-python-sdk-core-2.14.0.tar.gz (443 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 443.0/443.0 kB 12.2 MB/s eta 0:00:00 Preparing metadata (setup.py) ... done Requirement already satisfied: mpmath>=0.19 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from sympy->torch->xinference==0+untagged.256.g3a5082f) (1.3.0) Requirement already satisfied: jmespath<1.0.0,>=0.9.3 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aliyun-python-sdk-core>=2.13.12->oss2->modelscope->xinference==0+untagged.256.g3a5082f) (0.10.0) Requirement already satisfied: cryptography>=2.6.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from aliyun-python-sdk-core>=2.13.12->oss2->modelscope->xinference==0+untagged.256.g3a5082f) (41.0.2) Requirement already satisfied: importlib-metadata in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from diffusers>=0.18.0->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) (6.0.0) Requirement already satisfied: pyrsistent!=0.17.0,!=0.17.1,!=0.17.2,>=0.14.0 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from jsonschema>=3.0->altair<6.0,>=4.2.0->gradio>=3.39.0->xinference==0+untagged.256.g3a5082f) (0.18.0) Collecting absl-py>=0.4 (from tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for absl-py>=0.4 from https://files.pythonhosted.org/packages/01/e4/dc0a1dcc4e74e08d7abedab278c795eef54a224363bb18f5692f416d834f/absl_py-2.0.0-py3-none-any.whl.metadata Downloading absl_py-2.0.0-py3-none-any.whl.metadata (2.3 kB) Collecting grpcio>=1.48.2 (from tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for grpcio>=1.48.2 from https://files.pythonhosted.org/packages/a1/9c/ef89aae6948949a891a50e19bb951aac2f7ceb9561fdfdcd07c9b890ed6c/grpcio-1.58.0-cp311-cp311-macosx_10_10_universal2.whl.metadata Downloading grpcio-1.58.0-cp311-cp311-macosx_10_10_universal2.whl.metadata (4.0 kB) Collecting google-auth<3,>=1.6.3 (from tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for google-auth<3,>=1.6.3 from https://files.pythonhosted.org/packages/23/e4/abbb8763fdf6279c471443251b3f847ee9a172d1776742b266fe6de7ac86/google_auth-2.23.1-py2.py3-none-any.whl.metadata Downloading google_auth-2.23.1-py2.py3-none-any.whl.metadata (4.2 kB) Collecting google-auth-oauthlib<1.1,>=0.5 (from tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Downloading google_auth_oauthlib-1.0.0-py2.py3-none-any.whl (18 kB) Requirement already satisfied: markdown>=2.6.8 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) (3.4.1) Collecting tensorboard-data-server<0.8.0,>=0.7.0 (from tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for tensorboard-data-server<0.8.0,>=0.7.0 from https://files.pythonhosted.org/packages/da/61/6e9ff8258422d287eec718872fb71e05324356722ab658c8afda25f51539/tensorboard_data_server-0.7.1-py3-none-any.whl.metadata Downloading tensorboard_data_server-0.7.1-py3-none-any.whl.metadata (1.1 kB) Requirement already satisfied: werkzeug>=1.0.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) (2.2.3) Requirement already satisfied: cffi>=1.12 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from cryptography>=2.6.0->aliyun-python-sdk-core>=2.13.12->oss2->modelscope->xinference==0+untagged.256.g3a5082f) (1.15.1) Collecting cachetools<6.0,>=2.0.0 (from google-auth<3,>=1.6.3->tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for cachetools<6.0,>=2.0.0 from https://files.pythonhosted.org/packages/a9/c9/c8a7710f2cedcb1db9224fdd4d8307c9e48cbddc46c18b515fefc0f1abbe/cachetools-5.3.1-py3-none-any.whl.metadata Downloading cachetools-5.3.1-py3-none-any.whl.metadata (5.2 kB) Requirement already satisfied: pyasn1-modules>=0.2.1 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from google-auth<3,>=1.6.3->tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) (0.2.8) Collecting rsa<5,>=3.1.4 (from google-auth<3,>=1.6.3->tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Downloading rsa-4.9-py3-none-any.whl (34 kB) INFO: pip is looking at multiple versions of google-auth to determine which version is compatible with other requirements. This could take a while. Collecting google-auth<3,>=1.6.3 (from tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Obtaining dependency information for google-auth<3,>=1.6.3 from https://files.pythonhosted.org/packages/9d/44/5a992cb9d7bf8aaae73bc5adaf721ad08731c9d00c1c17999a8691404b0c/google_auth-2.23.0-py2.py3-none-any.whl.metadata Downloading google_auth-2.23.0-py2.py3-none-any.whl.metadata (4.2 kB) Collecting requests-oauthlib>=0.7.0 (from google-auth-oauthlib<1.1,>=0.5->tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Downloading requests_oauthlib-1.3.1-py2.py3-none-any.whl (23 kB) Requirement already satisfied: zipp>=0.5 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from importlib-metadata->diffusers>=0.18.0->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) (3.11.0) Requirement already satisfied: pycparser in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from cffi>=1.12->cryptography>=2.6.0->aliyun-python-sdk-core>=2.13.12->oss2->modelscope->xinference==0+untagged.256.g3a5082f) (2.21) Requirement already satisfied: pyasn1<0.5.0,>=0.4.6 in /Users/bojunfeng/anaconda3/lib/python3.11/site-packages (from pyasn1-modules>=0.2.1->google-auth<3,>=1.6.3->tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) (0.4.8) Collecting oauthlib>=3.0.0 (from requests-oauthlib>=0.7.0->google-auth-oauthlib<1.1,>=0.5->tensorboard->ms-swift->modelscope->xinference==0+untagged.256.g3a5082f) Downloading oauthlib-3.2.2-py3-none-any.whl (151 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 151.7/151.7 kB 8.8 MB/s eta 0:00:00 Downloading xoscar-0.1.2-cp311-cp311-macosx_10_9_universal2.whl (2.1 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.1/2.1 MB 14.0 MB/s eta 0:00:00 Downloading modelscope-1.9.1-py3-none-any.whl (5.1 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 5.1/5.1 MB 14.6 MB/s eta 0:00:00 Downloading ms_swift-1.1.0-py3-none-any.whl (82 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 82.2/82.2 kB 6.8 MB/s eta 0:00:00 Downloading aliyun_python_sdk_kms-2.16.2-py2.py3-none-any.whl (94 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 94.0/94.0 kB 8.3 MB/s eta 0:00:00 Downloading diffusers-0.21.3-py3-none-any.whl (1.5 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.5/1.5 MB 14.5 MB/s eta 0:00:00 Downloading loralib-0.1.2-py3-none-any.whl (10 kB) Downloading peft-0.5.0-py3-none-any.whl (85 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 85.6/85.6 kB 7.5 MB/s eta 0:00:00 Downloading pycryptodome-3.19.0-cp35-abi3-macosx_10_9_universal2.whl (2.4 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.4/2.4 MB 14.0 MB/s eta 0:00:00 Downloading tensorboard-2.14.1-py3-none-any.whl (5.5 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 5.5/5.5 MB 13.8 MB/s eta 0:00:00 Downloading absl_py-2.0.0-py3-none-any.whl (130 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 130.2/130.2 kB 10.4 MB/s eta 0:00:00 Downloading google_auth-2.23.0-py2.py3-none-any.whl (181 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 181.4/181.4 kB 9.9 MB/s eta 0:00:00 Downloading grpcio-1.58.0-cp311-cp311-macosx_10_10_universal2.whl (9.5 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 9.5/9.5 MB 13.5 MB/s eta 0:00:00 Downloading tensorboard_data_server-0.7.1-py3-none-any.whl (2.4 kB) Downloading cachetools-5.3.1-py3-none-any.whl (9.3 kB) Building wheels for collected packages: llama-cpp-python, sentence-transformers, oss2, aliyun-python-sdk-core, crcmod, jieba Building wheel for llama-cpp-python (pyproject.toml) ... done Created wheel for llama-cpp-python: filename=llama_cpp_python-0.2.7-cp311-cp311-macosx_13_0_arm64.whl size=855829 sha256=db5708daaa9347dec4a7c1aa2c28de6f4e04c8924ee0ce1170c1d508a2a8d1da Stored in directory: /Users/bojunfeng/Library/Caches/pip/wheels/58/e6/9a/75e88527e212edf2d4b348fbb6c00678e927aaec7869c79bdd Building wheel for sentence-transformers (setup.py) ... done Created wheel for sentence-transformers: filename=sentence_transformers-2.2.2-py3-none-any.whl size=125925 sha256=d6ea68d018359bebf96cfd7ba9a996d64ddb9a22e5ed73d4f23f091bc5a00592 Stored in directory: /Users/bojunfeng/Library/Caches/pip/wheels/ff/27/bf/ffba8b318b02d7f691a57084ee154e26ed24d012b0c7805881 Building wheel for oss2 (setup.py) ... done Created wheel for oss2: filename=oss2-2.18.2-py3-none-any.whl size=115292 sha256=ff250011cef9312710dbad9a7cfb71a91a740b6de71b7d52b95022f10c927c86 Stored in directory: /Users/bojunfeng/Library/Caches/pip/wheels/6c/41/0f/2872b018eb0798b3dfaa341c522ff1e13c51634163d24a8186 Building wheel for aliyun-python-sdk-core (setup.py) ... done Created wheel for aliyun-python-sdk-core: filename=aliyun_python_sdk_core-2.14.0-py3-none-any.whl size=535294 sha256=8027b8df4b8ac5fd94f3926c35e41f7b8cb9962b9bcf8dfb0801b921bcc8f2cb Stored in directory: /Users/bojunfeng/Library/Caches/pip/wheels/02/9f/f5/a5c9f5ed1d5b42de32b432df4180eef9e64d0617e34e8b7b07 Building wheel for crcmod (setup.py) ... done Created wheel for crcmod: filename=crcmod-1.7-cp311-cp311-macosx_11_0_arm64.whl size=21934 sha256=5127f0653e9d175591ed823c326ca1f06e75a5d70ea45305136bde100d0233ee Stored in directory: /Users/bojunfeng/Library/Caches/pip/wheels/23/94/7a/8cb7d14597e6395ce969933f01aed9ea8fa5f5b4d4c8a61e99 Building wheel for jieba (setup.py) ... done Created wheel for jieba: filename=jieba-0.42.1-py3-none-any.whl size=19314458 sha256=bb0eb4c01685bd858289c32b24948c5d92ede5dfa6965e4a8cff9107cb82d76b Stored in directory: /Users/bojunfeng/Library/Caches/pip/wheels/ac/60/cf/538a1f183409caf1fc136b5d2c2dee329001ef6da2c5084bef Successfully built llama-cpp-python sentence-transformers oss2 aliyun-python-sdk-core crcmod jieba Installing collected packages: jieba, crcmod, addict, tensorboard-data-server, simplejson, rsa, rouge, pycryptodome, oauthlib, loralib, llama-cpp-python, grpcio, gast, cachetools, absl-py, requests-oauthlib, google-auth, xoscar, torchvision, google-auth-oauthlib, diffusers, aliyun-python-sdk-core, tensorboard, sentence-transformers, peft, aliyun-python-sdk-kms, oss2, ms-swift, modelscope, xinference Attempting uninstall: llama-cpp-python Found existing installation: llama-cpp-python 0.1.77 Uninstalling llama-cpp-python-0.1.77: Successfully uninstalled llama-cpp-python-0.1.77 Attempting uninstall: xoscar Found existing installation: xoscar 0.1.0 Uninstalling xoscar-0.1.0: Successfully uninstalled xoscar-0.1.0 Attempting uninstall: xinference Found existing installation: xinference 0+untagged.256.g3a5082f Uninstalling xinference-0+untagged.256.g3a5082f: Successfully uninstalled xinference-0+untagged.256.g3a5082f Running setup.py develop for xinference Successfully installed absl-py-2.0.0 addict-2.4.0 aliyun-python-sdk-core-2.14.0 aliyun-python-sdk-kms-2.16.2 cachetools-5.3.1 crcmod-1.7 diffusers-0.21.3 gast-0.5.4 google-auth-2.23.0 google-auth-oauthlib-1.0.0 grpcio-1.58.0 jieba-0.42.1 llama-cpp-python-0.2.7 loralib-0.1.2 modelscope-1.9.1 ms-swift-1.1.0 oauthlib-3.2.2 oss2-2.18.2 peft-0.5.0 pycryptodome-3.19.0 requests-oauthlib-1.3.1 rouge-1.0.1 rsa-4.9 sentence-transformers-2.2.2 simplejson-3.19.1 tensorboard-2.14.1 tensorboard-data-server-0.7.1 torchvision-0.15.2 xinference-0+untagged.256.g3a5082f xoscar-0.1.2 ```
YinSonglin1997 commented 11 months ago

The problem seems to be relavent to XINFERENCE_MODEL_SRC=xorbits. This env var force xinference downloading models from our s3 bucket, which is not accessible right now.

We've added support of downloading from modelscope, and you can enable this feature byXINFERENCE_MODEL_SRC=modelscope.

我在constants.py中设置参数XINFERENCE_MODEL_SRC=“modelscope”,但是Xinference的界面似乎并没有按照llm_family_modelscope.json改变,请问还需要设置什么参数吗?