-
code in gguf.py 62 lines
```
response = requests.post(
f"{self.base_url}/v1/completions", json=request
)
```
I've been exploring the publicly available OpenAP…
-
I have check that
curl http://10.22.51.10:21010/v1/chat/completions -H "Content-Type: application/json" -d '{"model": "kagentlms_qwen_7b_mat", "messages": [{"role": "user", "content": "刘德华是谁"}]}'
…
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTor…
-
Love the progress so far!
Will you guys test and publish the full swe-bench and the 25% subset test besides just the swe-bench lite?
On auto-code-rover repo, it says 22% on swe-bench lite and 16% on…
-
和这里 https://github.com/QwenLM/Qwen2/issues/485 不太一样,用的是 vLLM,乱码不只是纯字母
```bash
压实 עסקי람เดอะagrant معظمCoupon赶赴 Swan skull끓ifstream/,inheritdoc SPA/colors neoScreen InteractionILI赟 relocation鲷ィ黑洞r…
-
### Motivation.
Reward models are an important tool in NLP / AI workflows, especially in agentic flows which use them to verify quality of intermediate outputs, or rank between several attempts at …
-
### Has this been raised before?
- [X] I have checked [the GitHub README](https://github.com/QwenLM/Qwen2).
- [X] I have checked [the Qwen documentation](https://qwen.readthedocs.io) and cannot find …
-
### 问题描述
按照所给流程执行后,出现了界面,但是提问时出现“An error occurred during streaming”
### 复现问题的步骤
1. 在输入框中输入“你好”.'
2. 出现错误“An error occurred during streaming”'
### 预期的结果
应该输出对“你好”的回复
### 实际结果
输出错误“An e…
-
使用windows部署,没有开代理或者梯子。用的中转api。
大概有30秒延迟,感觉不是openai那边的延迟。因为问完一个问题之后要等好久console才出现这个问题,然后说完询问chatgpt之后半秒钟左右就开始回答了。
型号是LX06
>xiaogpt --api_base "https://aium.cc/v1/" --openai_key "sk-**"
python版本3.…
-
I am getting below error:
RateLimitError: Error code: 429 - {'error': {'message': 'Rate limit reached for gpt-4o-mini in organization org-xxxx on tokens per min (TPM): Limit 200000, Used 198355, Re…