QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Apache License 2.0
13.59k stars 1.11k forks source link

[BUG] <title>Error occurs when call Qwen API: The server is overloaded or not ready yet #1100

Closed zhangjh closed 7 months ago

zhangjh commented 7 months ago

是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?

该问题是否在FAQ中有解答? | Is there an existing answer for this in FAQ?

当前行为 | Current Behavior

No response

期望行为 | Expected Behavior

No response

复现方法 | Steps To Reproduce

  1. deployed qwen api service using python3 openai_api.py image
  2. using the code in doc to call the deployed api: image
  3. there's a exception throw such as below: image How to solve this problem? Where am i wrong?

运行环境 | Environment

- OS: Ubuntu 23.10
- Python: Python 3.11.6
- Transformers: transformers 4.38.1
- PyTorch: torch 2.0.1
- CUDA (`python -c 'import torch; print(torch.version.cuda)'`): 11.7

备注 | Anything else?

No response

zhangjh commented 7 months ago

ignore this... It caused by the machine's settings. Proxy settings maked the network not connected.