-
Hello.
I wanted to try out dsrag on a pdf that I have.
However, I had/have a couple of problems:
1) After installing dsrag with pip, I still had to manually install `vertexai`, `google.generativea…
kubni updated
4 hours ago
-
It would be ideal to enable the user to convert the current draft files from "one chapter per cell" to "one verse per cell", "interlinear", "one pericope per cell", "one book per cell", etc.
We need …
-
Since OpenSearch 2.13, [**fixed token length algorithm**](https://opensearch.org/docs/latest/ingest-pipelines/processors/text-chunking/#fixed-token-length-algorithm) is available in text chunking proc…
-
# Tokenizer Import Error When Using Ollama Models
## Description
When attempting to use Ollama models (llama3, llama3.1, mistral), the application fails due to a tokenizer import error. The error …
-
### What is the bug?
I am using text_chunking and text_embedding processor to ingest documents into an index. The [text_chunking search example](https://opensearch.org/docs/latest/search-plugins/text…
-
Currently, the DocumentChunker class is a factory class that chooses between ContextAwareChunker and TextSplitChunker. We should drop in the `ContextAwareChunker` functionality into `DocumentChunker` …
-
**Is your feature request related to a problem? Please describe.**
The current chunking method can split text into parts that are too long for the model, leading to reduced quality, skipped words, or…
-
after try step from readme
```
curl -X POST http://127.0.0.1:8080/v1/create/rag -F "file=@paris.txt"
```
It took 590824.84 ms = nearly 1 minute for only chunking 306 lines (91KB) file on m3 max.
…
-
### Feature Description
Hi everyone, check this super amazing HTML chunking package :package:
`pip install html_chunking`
- Our HTML chunking algorithm operates through a well-structured process …
-
### Elasticsearch Version
8.15.2, 8.16, main
### Installed Plugins
_No response_
### Java Version
_bundled_
### OS Version
linux
### Problem Description
The addition of automatic chunking com…