mklarz / cipher_identifier

WIP of a script that tries to identify symbol ciphers (for now) available on dcode.fr based on input.
https://corax.team
16 stars 1 forks source link

Add image processing before passing the input to Tesseract #3

Open mklarz opened 4 years ago

mklarz commented 4 years ago

We need to improve the quality of the input images before we pass them to Tesseract to increase the chance of correctly guessing the cipher. See https://tesseract-ocr.github.io/tessdoc/ImproveQuality

Also see https://tesseract-ocr.github.io/tessdoc/ImproveQuality#examples

mklarz commented 3 years ago

Initial commit for this here: 940563dcfe2d524140cc49face55f6aababbf8a4, also related: https://github.com/corax/cipher_identifier/issues/22