text-chunking Search Results

WICG/writing-assistance-apis #14

Enhanced Summarizer API with Recursive Text Splitting

## Problem Currently, our summarizer API doesn't handle large documents efficiently. When the input text exceeds the model's context window, the API fails to process. Users need to manually split lar…

hemanth updated 2 days ago

elastic/elasticsearch #117187

[meta] Inference API Chunking

## Description Chunking is the process of breaking down large pieces of text into smaller chunks. For the purpose of this document chunking occurs at ingest for the use of embedding models. The reran…

dan-rubinstein updated 3 days ago

nextcloud/integration_openai #150

Chunking for summarize

Let's implement chunking in the same way it was done in LLM2 to allow summarizing texts that are longer than the model context size. * Implement chunking (maybe have the chunking logic in a service s…

julien-nc updated 1 week ago

meysamhadeli/codai #87

Support input embedding size more than 8K token

Big files seem to reach the 8k context of text-embedding-3-small using Open AI. There should probably be a chunking/retrieval strategy implementation for this.

rodrigomeireles updated 2 days ago

Unstructured-IO/unstructured-ingest #177

How do I add custom metadata when saving to Pinecone

I would like to add custom metadata to chunks when saved to pinecone with Pipeline.from_configs. Following the 'Custom meta data extraction ...' notebook on [this page](https://docs.unstructured.io…

dividor updated 2 days ago

microsoft/simplechat #6

Settings page for Admins

## **Ideas for General Settings** ### **1. Application Title** - **Description**: Modify the title of the application that appears in the browser tab, login page, and header. - **Implementation**: P…

paullizer updated 3 days ago

paullizer/SimpleChat #3

Settings page for Admins

## **Ideas for General Settings** ### **1. Application Title** - **Description**: Modify the title of the application that appears in the browser tab, login page, and header. - **Implementation…

paullizer updated 6 days ago

Unstructured-IO/unstructured-ingest #246

Pipeline hanging on partition

I am using **unstructured-ingest** _version **0.3.0**_, using the following code: ```python from unstructured_ingest.v2.interfaces import ProcessorConfig from unstructured_ingest.v2.pipeline.pi…

mahmoudaymo updated 6 days ago

phidatahq/phidata #1373

Feature Request: Add support for custom chunking strategy fo…

Inspiration: 5-levels of chunking [https://github.com/FullStackRetrieval-com/RetrievalTutorials/blob/main/tutorials/LevelsOfTextSplitting/5_Levels_Of_Text_Splitting.ipynb](https://github.com/FullSt…

yogin16 updated 3 weeks ago

sgt1796/story_crawler #4

Improve GPT_crawler performance

1. The crawling is often incomplete -- stories at later of the webpage will likely being ignored. Consider segmenting (chunking) text snap shot before passing to GPT. - decide which chunk size w…

sgt1796 updated 1 day ago

1000+ results for text-chunking

1000+ results
for text-chunking