-
Suggestions for Chapter 05 - [Understanding and managing bias – Intro to AI for GLAM](https://carpentries-incubator.github.io/machine-learning-librarians-archivists/05-managing-data-bias/index.html)
…
-
### 📦 部署环境
Vercel
### 📌 软件版本
1.12.3
### 💻 系统环境
Windows
### 🌐 浏览器
Chrome
### 🐛 问题描述
某些pdf文件上传后无法分块,出现如图错误
![屏幕截图 2024-08-22 165740](https://github.com/user-attachments/assets/c1b45582-b30b-4a…
-
I am trying to fine tune the clip model (clip-ViT-B-32-multilingual-v1). Is there example about training it with layers frozen? Also, can I train only the text encoder without modifying the image enco…
-
Commit# c039970b
See files body06, body07, and body08 of WJKOT volume 0471 (Song of Solomon).
Strange replacements occurred, and the UI backed up and creating strange chunking of the text. In th…
-
We need to decide how the viewer will know how to split up texts into pages for viewing. For Calpurnius, it's fairly straightforward: obviously, each poem is a reasonable viewing "chunk". But it will …
-
### Discussed in https://github.com/IEEE-Robotics-Club/MSU-NASA-Minds-2023/discussions/31
Originally posted by **pepsiman3** February 2, 2023
1. Chunking
- Take the desired space (specified …
-
The embedding model is used for TestsetGenerator:
```py
generator = TestsetGenerator.from_langchain(generator_llm, critic_llm, embedding_model)
dataset = generator.generate_with_langchain_docs(docu…
-
I'm tracking down a weird bug on a big query, running search locally connected via tunnel to prod ES index. I'm seeing a log message that seems to indicate the query is being chunked into lots of chun…
-
## Background
Reindex allows users to create new indexes with data that is already in elasticsearch. This is especially useful for moving to semantic search because users often have already implemente…
-
This issue is where we dump ideas and you can upvote them.