R0Wi-DEV / workflow_ocr

This is a Nextcloud Workflow App which enables you to process files via OCR on serverside.
GNU Affero General Public License v3.0
79 stars 6 forks source link

OCRmyPDF did not produce any output for file #242

Closed Markoise closed 4 months ago

Markoise commented 6 months ago

Describe the bug Every time a PDF file is to be processed via the flow, the following error appears

OCRmyPDF succeeded with warning(s): InputFileError: pdfminer could not process page 0 (counting from 0)., OcrResultEmptyException OCRmyPDF did not produce any output for file /name/files/file.pdf

A clear and concise description of what the bug is.

System

How to reproduce

Steps to reproduce the behavior:

  1. Tag the file with the appropriate keyword.
  2. Wait
  3. Error appears
  4. no OCR

Screenshots

If applicable, add screenshots to help explain your problem.

Server log

Please paste relevant content of your nextcloud.log file here. It might make sense to first decrease the Loglevel. Also, since the OCR process runs asynchronously, run your cron.php before copying the logs here.

OCRmyPDF succeeded with warning(s): InputFileError: pdfminer could not process page 0 (counting from 0)., 
OcrResultEmptyException OCRmyPDF did not produce any output for file /name/files/file.pdf

Browser log

If you're observing Browser errors, please paste your developer tools logs here.

Help for Chrome: https://developer.chrome.com/docs/devtools/console/#view Help for Firefox: https://firefox-source-docs.mozilla.org/devtools-user/browser_console/index.html

Paste your developer tools logs here. 

Additional context

Add any other context about the problem here.

R0Wi commented 6 months ago

Thanks for reporting this. Unfortunately it seems like this error is thrown by ocrmypdf itself and we cannot really help you here. It seems like the error is thrown here in the code. We would kindly advise you to create a new issue at ocrmypdf directly.

Please let us know if there's anything else where we can help you.

Markoise commented 6 months ago

Now unfortunately I don't know what I can do. Unfortunately, I probably won't be able to use it anymore.