-
Benchmarking des solutions d'OCR (en vue d'une intégration à terme dans Albert-API)
-
### Description of the new feature / enhancement
Please could we have a way to use the snipping tools OCR models for entire documents ?
### Scenario when this would be used?
It's extremely useful …
-
I am getting the following error while uploading certain PDF files. This is reproducible every time with some PDF files.
Working fine for most of the PDF files.
```
Starting file converter bat…
-
### Description of the bug | 错误描述
as reported in issue #708, detection of Umlaut / vowel mutation (äöüÄÜÖ) in German OCR isnt working well. Furthermore, french accents are not well identified (éèÀ);…
-
### Describe the proposed feature
Sometimes, I want to remove the OCR layer from a PDF. However, there is no good way of doing that yet.
Running `gs -o out.pdf -sDEVICE=pdfwrite -dFILTERTEXT in.…
-
I'm using scribe.js to batch process a large number of PDFs. The error below keeps emerging from time to time. Its appearance is pretty random and is NOT related to specific PDFs. After restarting the…
-
**Is your feature request related to a problem? Please describe.**
No, it is a feature request to use it as the scanner and OCR App for paperless-ngx and alike
**Describe the solution you'd like**…
-
- Process stalls until killed, when running on MacOS with OCR enabled with PDF documents that has embedded images in them.
- OCR works fine with direct images but the bug is seen only on PDFs with e…
-
Hi Eduard,
Thank you for creating such a powerful package!
I wonder if you plan to extend the PDF extraction functionality in `llm_message()` to automatically detect whether the PDF is multi-col…
-
### Bug description
I am working on OCR scenarios where documents can be rotated. I have set `detect_orientation=True` to observe the page orientation value. I am noticing inconsistent result for pag…