-
I'm running zephyr-7b-alpha in a Docker container on Windows 10 with two RTX 3070 GPUs. However, when I try to make a call with the original example request on the /v1/chat/completions endpoint, it se…
cay89 updated
1 month ago
-
Memory is required to keep tracks of the previous messages in a conversation. It's also required when using tools.
This issue is about addressing two issues:
- how to configure the memory size -…
-
**Describe the bug**
I am trying to implement structured output in spring boot, and I am getting exception while doing.
I have an assistant which makes elastic query and returns some structured …
-
Hello,
we are using FIWARE with quite large entities and seem to run into an Orion-LD Context Broker limitation. In our scenario we collect around 4.000 tags from about 50 entities with OPC UA IoT-…
-
### Answers checklist.
- [X] I have read the documentation [ESP-IDF Programming Guide](https://docs.espressif.com/projects/esp-idf/en/latest/) and the issue is not addressed there.
- [X] I have up…
Koxx3 updated
2 months ago
-
# Description
Since version 1.43.0, `google-vertexai-aiplatform` has added transport override to enable the use of REST instead of GRPC ([6ab4084](https://github.com/googleapis/python-aiplatform/comm…
-
I have a 7900XT and would definitely love to have ROCm support. It seems like it might be coming with https://github.com/jmorganca/ollama/pull/667?
I couldn't find a dedicated issue for this so I'm…
-
访问gpt4模型时会返回以下错误,请帮忙看看是啥原因
`2024/07/05 11:53:43,stdout,2024/07/05 03:53:43 Open AI ❌ LLM 响应异常 BadRequestError: 400 Invalid parameter: 'response_format' of type 'json_object' is not supported with thi…
-
The system message seems to be completely ignored as soon as a data input is given, meaning the persona of the bot goes back to default.
Simple system messages such as "you reply in only one word" …
-
Tracking updates of cloud.google.com