text-chunking Search Results

1000+ results
for text-chunking

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

D-Star-AI/dsRAG #73

Import "dsrag.document_parsing" from the README example cou…

Hello. I wanted to try out dsrag on a pdf that I have. However, I had/have a couple of problems: 1) After installing dsrag with pip, I still had to manually install `vertexai`, `google.generativea…

kubni updated 4 hours ago
1
genesis-ai-dev/codex-editor #41

Enable multiple options for chunking text

It would be ideal to enable the user to convert the current draft files from "one chapter per cell" to "one verse per cell", "interlinear", "one pericope per cell", "one book per cell", etc. We need …

ryderwishart updated 2 months ago
1
opensearch-project/neural-search #794

[RFC] Model-based Tokenizer for Text Chunking

Since OpenSearch 2.13, [**fixed token length algorithm**](https://opensearch.org/docs/latest/ingest-pipelines/processors/text-chunking/#fixed-token-length-algorithm) is available in text chunking proc…

yuye-aws updated 1 month ago
9
ScrapeGraphAI/Scrapegraph-ai #766

Tokenizer Import Error When Using Ollama Models

# Tokenizer Import Error When Using Ollama Models ## Description When attempting to use Ollama models (llama3, llama3.1, mistral), the application fails due to a tokenizer import error. The error …

AnukaMithara updated 3 weeks ago
3
opensearch-project/k-NN #2113

[FEATURE] inner_hits in nested neural query should return al…

### What is the bug? I am using text_chunking and text_embedding processor to ingest documents into an index. The [text_chunking search example](https://opensearch.org/docs/latest/search-plugins/text…

yuye-aws updated 3 weeks ago
13
instructlab/sdg #334

Chunking Refactor: Always use Context-Aware Chunker

Currently, the DocumentChunker class is a factory class that chooses between ContextAwareChunker and TextSplitChunker. We should drop in the `ContextAwareChunker` functionality into `DocumentChunker` …

aakankshaduggal updated 1 week ago
3
FunAudioLLM/CosyVoice #599

Improving Long-Form Generation with Customizable Chunking Me…

**Is your feature request related to a problem? Please describe.** The current chunking method can split text into parts that are too long for the model, leading to reduced quality, skipped words, or…

GalenMarek14 updated 3 weeks ago
1
LlamaEdge/rag-api-server #9

Slow chunking the text file

after try step from readme ``` curl -X POST http://127.0.0.1:8080/v1/create/rag -F "file=@paris.txt" ``` It took 590824.84 ms = nearly 1 minute for only chunking 306 lines (91KB) file on m3 max. …

katopz updated 6 months ago
6
run-llama/llama_index #15974

[Feature Request]: Better HTML Chunking

### Feature Description Hi everyone, check this super amazing HTML chunking package :package: `pip install html_chunking` - Our HTML chunking algorithm operates through a well-structured process …

KLGR123 updated 2 months ago
1
elastic/elasticsearch #116022

OOM when performing inference on an extremely large document…

### Elasticsearch Version 8.15.2, 8.16, main ### Installed Plugins _No response_ ### Java Version _bundled_ ### OS Version linux ### Problem Description The addition of automatic chunking com…

maxhniebergall updated 2 days ago
9

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for text-chunking

1000+ results
for text-chunking