ocr-pdf Search Results - Githubissues

1000+ results
for ocr-pdf

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zylon-ai/private-gpt #1960

Scanned PDFs are not loaded with no error

I noticed scanned PDFs are not imported when loaded with the SDK or the GUI. To cope with that, someone implemented an OCR layer (#1610). You can simulate this behavior with any scanned PDF, such as …

mrepetto-certx updated 2 weeks ago
3
IceWhaleTech/CasaOS-AppStore #595

[App Request] Paperless-ngx & Stirling PDF

### App Information A locally hosted one-stop shop for all your PDF needs · Powerful PDF tools. **Stirling PDF** provides you with powerful, easy to use tools to manage your PDF files. - Official We…

Feifel81 updated 2 weeks ago
1
opendatalab/MinerU #674

偶尔会出现找不到PDF中的图片的错误，然后程序退出

### Description of the bug | 错误描述我的机器很差，内存只有40G，怕解析中途内存爆了，在解析一些5000多页的PDF的时候，我会先把PDF切成80页一个的小文件，然后再用MAGIC-PDF去解析。然后一大堆文件中偶尔会看到回显有如下日志这样的找不到图片的错误，一旦出现这样的错误，这个PDF就不会有任何layout或者markdown文件被输出。不知道是不是…

WXpiero updated 1 month ago
1
ocrmypdf/OCRmyPDF #1157

OCR-Generated Text Layers Not Readable by PDF Readers for RT…

### What were you trying to do? I have used ocrmypdf to perform OCR on a PDF document, but I'm encountering a specific issue with RTL (right-to-left) languages like Persian. Despite successful OCR …

PSEUDO-SAPPHO updated 3 weeks ago
9
facebookresearch/nougat #226

Why can't I run nougat-ocr on pdfs?

`ImportError: cannot import name 'cached_property' from 'nougat.utils' (/lfs/skampere1/0/emilyhyf/miniconda/lib/python3.12/site-packages/nougat/utils/__init__.py) OCRing with base model failed on /lf…

emilyhanyf updated 3 weeks ago
5
hiroi-sora/Umi-OCR #642

docker 作为 server 提供 api 接口长时间运行后，api 接口服务失效

### Issues - [X] I have browsed through the Issues. 我已浏览过Issues，确定没有重复提问。 ### Umi-OCR version 程序版本 2.1.3 ### Windows version 系统版本 linux docker ### OCR plugins Used 使用的OCR插件 _No response_ ### R…

BigGan updated 2 weeks ago
5
DS4SD/docling #225

Convert pdf to md simplified Chinese character issue

All simplified Chinese characters in the MD file generated from PDF are garbled. I user docling version 2

JerryXu2023 updated 14 hours ago
6
Future-House/paper-qa #184

Support for full PDF "image text" OCR in pymupdf

Can we add some sort of toggle / support for enabling full page OCR reading via Tesseract, when pymupdf is installed? I hacked around the vendored library in my local virtualenv and made a change in `…

kvnxiao updated 3 weeks ago
1
hiroi-sora/Umi-OCR #638

在wimdow图片识别没问题，centos 7没有识别出来 ,OCR插件Used 使用的OCR插件RapidOCR

### Issues - [X] I have browsed through the Issues. 我已浏览过Issues，确定没有重复提问。 ### Umi-OCR version 程序版本 2.1.3 ### Windows version 系统版本 windows10 ### OCR plugins Used 使用的OCR插件 PaddleOCR ### Reproduc…

deict updated 2 months ago
7
Unstructured-IO/unstructured #2991

feat/ocr_layer_to_pdf

**Is your feature request related to a problem? Please describe.** When I OCR a PDF, I would like to be able to open the PDF and see the OCRed text as a hidden layer. **Describe the solution you'd…

punjabdhaputar updated 5 months ago
3

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for ocr-pdf

1000+ results
for ocr-pdf