Closed fremont444 closed 2 years ago
Anyone found a solution to the 'long s' problem when OCR-ing early French texts? i.e. 'long s' comes out as an 'f'
If you copy and paste text from pdfs in Okular this problem disappears. Anyone know why?
This is related to tesseract resp. the tessdatas, gImageReader is just a front-end and does not do any recognition itself.
Anyone found a solution to the 'long s' problem when OCR-ing early French texts? i.e. 'long s' comes out as an 'f'
If you copy and paste text from pdfs in Okular this problem disappears. Anyone know why?