virantha / pypdfocr

Python script to do PDF OCR conversion using Tesseract
Apache License 2.0
372 stars 114 forks source link

Specify postfix of ocr'd pdf in config #52

Open mikafinja opened 7 years ago

mikafinja commented 7 years ago

It would be nice to specify the postix of the ocr'd file in the config and even turning it off would be great. I process all my scanned documents with your tool. I build a little 'wrapper' around it (just an ugly bash script) to remove the _ocr at the end of the filename.