text-chunking Search Results

1000+ results
for text-chunking

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jina-ai/late-chunking #2

How to implement late chunking when my context limit is more…

Jina.ai support a token limit of 8192 for generating the embeddings. For late chunking if my context is more than 8192, then what are the best strategies to implement late chunking?

venkatana-kore updated 1 month ago
2
racai-ai/TEPROLIN #1

Help for script adaptation!

``` import requests import json # URL-ul serverului Teprolin url = "http://127.0.0.1:5000/process" # Textul pe care dorim să îl procesăm text = "Ion și-a cumpărat o mașină de tuns iarba de_l…

dix83 updated 1 week ago
1
Unstructured-IO/unstructured #3194

Suggestion: include consolidated bounding box coordinates in…

**Problem** Currently when "by_title" chunking strategy is used and `coordinates = true` parameter is set (in order to return coordinates of the PDF chunks), coordinates are not returned (because in …

m-kemarskyi updated 4 months ago
2
ethereum/solidity #14389

IPFS hash feature use non-specified algorithm which is not w…

It looks like you are implementing what looks like the [Kubo](https://github.com/ipfs/kubo/) defaults, they are nearing 10 years and lack the newest features we support, I thus want to change thoses s…

Jorropo updated 1 week ago
2
Unstructured-IO/unstructured #3280

List block in a partitioned Markdown doc identified as a `Ti…

I noticed that when the number of characters per line is very short in a list block in a Markdown document, the list is identified as a `Title` instead of a `NarrativeText`. It prevents the chunkin…

nickphilip updated 3 months ago
7
Unstructured-IO/unstructured #2990

feat/ Param to control the behavior of chunking when encount…

**Is your feature request related to a problem? Please describe.** Currently, when processing PDF documents using the chunk_by_title function from the Unstructured library, a Table element always f…

LucasOliveira44 updated 6 months ago
6
UnitedLexCorp/SimpleTalk #42

On Chunking

# Chunking: Proposals and Discussion ## What is Chunking? (For more detail, see PDF page 143 of [The HyperTalk manual](https://cancel.fm/stuff/share/HyperCard_Script_Language_Guide_1.pdf)) The …

darth-cheney updated 4 years ago
11
tidyverse/readr #1410

`read_delim_chunked` takes much more memory than expected?

I am using the [read_delim_chunked](https://readr.tidyverse.org/reference/read_delim_chunked.html) function to process large text files chunk-by-chunk. My expectation is that memory is cleared after e…

timothy-barry updated 1 year ago
6
danswer-ai/danswer #1183

Custom chunking strategy (splitting characters)

I am trying to build chatbot based on FAQ documentation. It uses text file as a list of question-answer pairs. However, base chunking strategy sometimes splits chunks in the middle of an answer or be…

Pasmikh updated 8 months ago
1
FineUploader/fine-uploader #1542

JS error when concurrent chunking is denied by server side v…

Type of issue: Bug? Uploader type: traditional Fine Uploader version: 5.5.1 Browsers where the bug is reproducible: All Operating systems where the bug is reproducible: Windows 7 **Steps to reproduce…

slava-uxd updated 8 years ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for text-chunking

1000+ results
for text-chunking