-
**Describe the bug**
OpenAI API endpoint is "/v1/chat/completions", but OVMS endpoint is "/v3/chat/completions".
most of existing application doesn't allow user to modify the prefix “**V1**” to "**…
-
I've come across an issue when submitting toll outputs and streaming the response back to the client. The stream just seems to end straight away and doesn't send any data back.
Alternatively, when I…
-
### System Info / 系統信息
vllm 0.6.3.post1
transformers 4.46.1
glm-4v-9b
### Who can help? / 谁可以帮助到您?
_No response_
### Information / 问题信息
- [ ] The official example scripts / 官方的示例脚本
- [X] My own…
-
### Model Series
Qwen2.5
### What are the models used?
Qwen2.5-72B-Instruct-AWQ
### What is the scenario where the problem happened?
vllm启动
### Is this a known issue?
- [X] I have…
-
# 尝试评测InternVL2-1B时报错
## 这是评测运行的命令
`python run.py --data MathVision_MINI --model InternVL2-1B`
换成 2B的its work,可能是因为1B的基座是qwen2导致的问题
`python run.py --data MathVision_MINI --model InternVL2-2B`
…
-
**Describe the bug**
I re-loaded an agent after attaching some data, but got this error immediately when running `memgpt run` again.
**Please describe your setup**
- [x] MemGPT version: 0.3.0
…
-
> Use "assistants" for [Assistants](/docs/api-reference/assistants) and [Message](/docs/api-reference/messages) files, "vision" for Assistants image file inputs,
The purpose documentation says "v…
-
This is an enhancement request.
OpenAI has overhauled the assistants Api - https://platform.openai.com/docs/changelog/released-assistants-api-updates
I'm hoping that Forge.OpenAI can be enhanced…
-
### Confirm this is a feature request for the Node library and not the underlying OpenAI API.
- [X] This is a feature request for the Node library
### Describe the feature or improvement you're requ…
-
From a given context + query, the model generates well the answer, but very long -> around `2000 chars`.
Is there any way to do `max_output_tokens=200` like pplx or openAI API?
This is my prompt t…