-
I'm using the **text-embedding-3-large** model and configuring **Euclidean distance** as the similarity metric in **Qdrant**. After indexing my data with these settings, I noticed that some returned d…
-
Hi,
I find that it's nice to have a few benchmark datasets integrated into libraries for easier research. My feature request boils down to the implementation of a few image retrieval datasets, name…
-
Would be nice to be able to filter by the similarity search score on a database level.
For example, If I wanted to return just the records that is greater than or equal to 0.80. I am able to do thi…
-
If I'm using a retriever (eg: HybridRetriever or VectorRetriever) is it possible for me to add constrains to any of the properties.
Example:
Node: Note
Properties: user_id, content, content_embeddi…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
After doing some analysis using the LlamaDebugHandler, I noticed that all the vectors + …
-
### Microsoft PowerToys version
v0.85.1
### Installation method
PowerToys auto-update
### Running as admin
None
### Area(s) with issue?
TextExtractor
### Steps to reproduce
Text Extractor Sho…
-
### Page URL
https://docs.netapp.com/us-en/ontap/nfs-admin/handle-clients-unlisted-security-type-concept.html
### Page title
Manage clients with an unlisted security type
### Summary
Proposal for…
-
Here is my code:
```
from difflib import ndiff
from time import perf_counter
from datasets import load_dataset
from datasketch import MinHash, MinHashLSH
import numpy as np
from model2vec imp…
-
jina-embeddings-v3 is a multilingual multi-task text embedding model designed for a variety of NLP applications. Based on the [Jina-XLM-RoBERTa architecture](https://huggingface.co/jinaai/xlm-roberta-…
-
## Description
Knowledge graph triples are generated by providing prompts to LLMs. Due to constraints like context length and the need for better output quality, the unstructured text is processed in …