text-chunking Search Results

1000+ results
for text-chunking

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

carpentries-incubator/machine-learning-librarians-archivists #103

Chapter 5 - Lesson structure update and suggestions

Suggestions for Chapter 05 - [Understanding and managing bias – Intro to AI for GLAM](https://carpentries-incubator.github.io/machine-learning-librarians-archivists/05-managing-data-bias/index.html) …

kjallen updated 9 months ago
1
lobehub/lobe-chat #3551

[Bug] 对某些上传的pdf文件分块失败

### 📦 部署环境 Vercel ### 📌 软件版本 1.12.3 ### 💻 系统环境 Windows ### 🌐 浏览器 Chrome ### 🐛 问题描述某些pdf文件上传后无法分块，出现如图错误 ![屏幕截图 2024-08-22 165740](https://github.com/user-attachments/assets/c1b45582-b30b-4a…

havelhuang updated 2 weeks ago
34
UKPLab/sentence-transformers #2866

Clip fine tuning

I am trying to fine tune the clip model (clip-ViT-B-32-multilingual-v1). Is there example about training it with layers frozen? Also, can I train only the text encoder without modifying the image enco…

capricixhk updated 2 months ago
3
bhdirect-ebooks/percival #41

Percival damaging code

Commit# c039970b See files body06, body07, and body08 of WJKOT volume 0471 (Song of Solomon). Strange replacements occurred, and the UI backed up and creating strange chunking of the text. In th…

mmagnussen updated 5 years ago
4
DigitalLatin/viewer #1

Need a convention for splitting texts

We need to decide how the viewer will know how to split up texts into pages for viewing. For Calpurnius, it's fairly straightforward: obviously, each poem is a reasonable viewing "chunk". But it will …

hcayless updated 7 years ago
1
IEEE-Robotics-Club/MSU-NASA-Minds-2023 #33

Pathfinding algorithm (simulation)

### Discussed in https://github.com/IEEE-Robotics-Club/MSU-NASA-Minds-2023/discussions/31 Originally posted by **pepsiman3** February 2, 2023 1. Chunking - Take the desired space (specified …

adamcate updated 1 year ago
2
explodinggradients/ragas #1098

Do we need to chunk documents before text set generation?

The embedding model is used for TestsetGenerator: ```py generator = TestsetGenerator.from_langchain(generator_llm, critic_llm, embedding_model) dataset = generator.generate_with_langchain_docs(docu…

hanfei1986 updated 3 months ago
3
mediacloud/mediacloud-news-client #3

modify status_code check to be more stringent

I'm tracking down a weird bug on a big query, running search locally connected via tunnel to prod ES index. I'm seeing a log message that seems to indicate the query is being chunked into lots of chun…

rahulbot updated 9 months ago
5
elastic/elasticsearch #113948

[ML] Overview of reindex issues with NLP

## Background Reindex allows users to create new indexes with data that is already in elasticsearch. This is especially useful for moving to semantic search because users often have already implemente…

maxhniebergall updated 1 month ago
1
NirantK/ai-engineering-powertools #1

Ideas

This issue is where we dump ideas and you can upvote them.

NirantK updated 1 month ago
16

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for text-chunking

1000+ results
for text-chunking