text-chunking Search Results

1000+ results
for text-chunking

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hackerai-tech/PentestGPT #204

Improve RAG with Custom Code for Embedding and Metadata Extr…

### Description The most crucial factor for HackerGPT is the quality of AI responses. To significantly improve the RAG system, we need to create custom code for text embedding and metadata extraction…

RostyslavManko updated 6 months ago
2
NVIDIA/TensorRT-LLM #2448

Unable to profile cpp benchmark due to NCCL error

### System Info - CPU: x86_64, Intel(R) Xeon(R) Platinum 8470 - CPU/Host memory size: 1TB - GPU: 4xH100 96GB - Libraries TensorRT-LLM: main, 0.15.0 (commit: b7868dd1bd1186840e3755b97ea3d3a73dd…

YJHMITWEB updated 1 week ago
1
UKPLab/sentence-transformers #2540

padding error - enforce padding to a multiple of N tokens

While integrating a mega-based encoder ([BEE-spoke-data/mega-encoder-small-16k-v1](https://huggingface.co/BEE-spoke-data/mega-encoder-small-16k-v1)) with the sentence-transformers library, I've encoun…

pszemraj updated 8 months ago
3
ashenoy463/mdx #8

Adopting xarray as sole canonical format

Reasons for: - 1 time investment. no more dealing with text stream overhead, only optimised operations. - Respect the poly-indexability of our data. We can index with timestep, box-time or atom_id, …

ashenoy463 updated 6 months ago
1
mlibrary/cozy-sun-bear #38

Do basic usability testing on text display mode labels

- Do the terms we've created for the two text display modes make sense to users? - Does page-by-page successfully capture the reading experience of reflowable text? - Does two-column make sense fo…

jmcglone updated 6 years ago
1
opensearch-project/k-NN #1743

[FEATURE] Score mode support other than max with KNN nested …

Current KNN nested field works with max score mode which use max score among child documents(nested field document) as the parent document score. I would like to use other score mode like avg or sum o…

heemin32 updated 3 weeks ago
2
iscc/iscc-specs #51

Support granular similarity hashes for Content-ID

Use-Case: A user has a small chunk of text and wants to find longer text that contain this chunk or a similar chunk. Proposed solution draft: Apply shift-invariant text-chunking (for example ~100…

titusz updated 4 years ago
1
bhavnicksm/chonkie #55

[BUG] Newlines are not removed after pre-processing in Seman…

**Describe the bug** Currently, raw_sentences includes "\n" strings [here](https://github.com/bhavnicksm/chonkie/blob/main/src/chonkie/chunker/semantic.py#L159). This means that embeddings are create…

Pringled updated 2 days ago
3
Unstructured-IO/unstructured #3012

Table Title and Table content separate chunks: Merge content…

Hi, I am using partition and chunk_by_title to chunk my pdfs. It generally works but when I investigated the chunks I saw that if there is a Table in one of my documents, the title of the table is …

weissenbacherpwc updated 3 months ago
8
hamdanal/rich-argparse #119

Adding line breaks?

Of all aspects challenging the readability of an argparse output for the 95% of us, or making people avoid reading too much, perhaps the density of the text is one of the worst sticking points. This i…

matanox updated 3 months ago
1

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for text-chunking

1000+ results
for text-chunking