-
Benchmarking des solutions d'OCR (en vue d'une intégration à terme dans Albert-API)
-
### Description of the new feature / enhancement
Please could we have a way to use the snipping tools OCR models for entire documents ?
### Scenario when this would be used?
It's extremely useful …
-
I am getting the following error while uploading certain PDF files. This is reproducible every time with some PDF files.
Working fine for most of the PDF files.
```
Starting file converter bat…
-
### Description of the bug | 错误描述
as reported in issue #708, detection of Umlaut / vowel mutation (äöüÄÜÖ) in German OCR isnt working well. Furthermore, french accents are not well identified (éèÀ);…
-
### Describe the proposed feature
Sometimes, I want to remove the OCR layer from a PDF. However, there is no good way of doing that yet.
Running `gs -o out.pdf -sDEVICE=pdfwrite -dFILTERTEXT in.…
-
I'm using scribe.js to batch process a large number of PDFs. The error below keeps emerging from time to time. Its appearance is pretty random and is NOT related to specific PDFs. After restarting the…
-
**Is your feature request related to a problem? Please describe.**
No, it is a feature request to use it as the scanner and OCR App for paperless-ngx and alike
**Describe the solution you'd like**…
-
### Issues
- [X] I have browsed through the Issues. 我已浏览过Issues,确定没有重复的建议。
### Expected behavior 预期的功能
ofd格式文件用于国产替代pdf方案 越来越多地应用起来 但最新Umi-ocr还不支持识别ofd格式
目前只能将ofd文档导出为图片/pdf然后再导入Umi-ocr 期待后续能够原生…
-
- Process stalls until killed, when running on MacOS with OCR enabled with PDF documents that has embedded images in them.
- OCR works fine with direct images but the bug is seen only on PDFs with e…
-
Hi Eduard,
Thank you for creating such a powerful package!
I wonder if you plan to extend the PDF extraction functionality in `llm_message()` to automatically detect whether the PDF is multi-col…