Tesseract recommends a minimum dpi of 300 (here) and the default pdftoppm dpi is 150. I experienced poor accuracy on some documents and increasing the dpi fixed the issue. This could be controllable by a keyword argument, but the tesseract recommended setting seems like a better default. Love this package, it's a lifesaver.
Tesseract recommends a minimum dpi of 300 (here) and the default pdftoppm dpi is 150. I experienced poor accuracy on some documents and increasing the dpi fixed the issue. This could be controllable by a keyword argument, but the tesseract recommended setting seems like a better default. Love this package, it's a lifesaver.