-
### System Info / 系統信息
8卡A-800,cuda12.2,
transformers 4.40.2
torch 2.1.2
### Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?
- [ ] docker / docker
- [X] pip install / 通过 pip …
-
- Fetch harmonized records from api or database
- use an algorithm which detects similarity between 2 records based on title, abstract, keywords
- consider translated content and make sure records are…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
```
import time
import weaviate
from llama_index.core import VectorStoreIndex, …
-
Building a RAG application using Spring AI 1.0.0.M1 version, JDK 21, Ollama & Milvus v2.4.0 vector store running as a docker container using docker compose. Please note, I am creating a new Collection…
-
Getting this error on model instantiation
```
BadRequestError(400, 'x_content_parse_exception', '[1:22] unknown field [text_embedding]')
```
-
[https://github.com/timescale/python-vector/blob/34e51abff2f401f0e8aea71d5537b193a6fe34cb/timescale_vector/client.py#L768](url)
query = '''
SELECT
id, metadata, conten…
-
### LanceDB version
_No response_
### What happened?
While using Watsonx models through Embedding API throwing error
```Status code: 400, body: {"errors":[{"code":"invalid_input_argument","message…
-
Describe the bug
When using the OpenAI GPT-4o model with MemGPT, calling the/app/agents/id/messages interface never returns a result, suspected to be stuck in a dead loop.
Please describe your se…
-
## Detailed Description
Let's experiment with giving our models:
* An embedding of the GSP ID (used in the query? And for each row of GSP-level PV data if we use historical GSP-level PV?)
* An e…
-
I setup the demo based on ChatQnA (TGI) on Xeon (GNR).
Try RAG by the UI.
After upload the PDF file (2-5M), I search a question.
It will take 10-15s.
When update a text file with 3 lines, it's 2…