gkovacs / pdfocr

Adds text to PDF files using the cuneiform OCR software
MIT License
325 stars 49 forks source link

Add an option to clean up the pages #14

Open spitters opened 10 years ago

spitters commented 10 years ago

Tools like http://scantailor.org/ or http://code.google.com/p/ocrfeeder/ can clean up separate pages. Maybe this can be integrated in the process?

wodin commented 7 years ago

I think the --unpaper option solves this? It was added in December 2015, so I suspect this issue can be closed.

37e0f03b5cf4f9fc6bd4a2e968b90b7c6c888c7e