-
It'd be nice if our alt text were better. Currently it just says, "Thumbnail of page X of the PDF".
**First,** we could make that better just by saying, "Thumbnail of page X of the PDF linked above…
-
How can I get the text in natural reading order (left to right) with detect_document_text with line break info?
Example image:
document.text output:
```
quick a brown fox
jumps over the laz…
-
### Is your feature request related to a problem? Please describe.
Highlighting search results keywords using string replacement is outdated
### Describe the solution you'd like
Native API `CSS.hig…
-
when I use opencompass run Qwen2.5-math ,error for example
I don't know why there is a certain probability of outputting dirty and messy data mixed with Chinese and English.
It doesn't happen every …
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Problem description
Steps to reproduce :
* Create new document
* Add an embedded Text Document
* Save th…
-
**Describe the bug**
Getting imprecise error message with status 500 after trying to upsert vectors into Postgres. Happens with any kind of file and chunk content. Happens even if I only leave 1 chun…
-
> 什么eval???
```javascript
catch (err) {
var newD = document.createElement('div');
newD.innerText = `${err.message}`;
…
-
### Type of issue
Typo
### Description
That page says this:
ReadOnlySpan text = "some arbitrary text";
return text.StartsWith('"') && text.EndsWith('"'); // false
Shouldn't the comment indicate t…
-
Hellow Everyone,
I am currently working with the PyTerrier framework and have encountered an issue while trying to access document content their corresponding document IDs after indexing a dataset.…
-
At present, the indexing process extracts full text from Doc and PDF files. This can be a slow and expensive process that can cause problems during reindexing. We should cache the extracted text from …