Closed wkoszycki closed 3 years ago
The exception originated from the native code. HistogramRect
method is defined in Otsu thresholding module otsuthr
. You may want to trace through it to determine the root cause of why some of your images were not consumable by tesseract engine.
@nguyenq thanks I will try to reproduce with pure tesseract and get back to you
@nguyenq I have tried to run via cmd with all available psm options
tesseract -l pol --oem 1 --psm <0-13> input.tif output.txt
no error occurred
To replicate issue and get all options during tess4j execution I set logging.level.net.sourceforge.tess4j=DEBUG
but there were additional logs. Is there a way to get exact info what is being executed underneath ?
Both Tess4J and Tesseract source code is available for your investigation. If you can set up your IDE for native code debugging, you'd be able to step from Tess4J's Java code into Tesseract's C++ code and observe what is going under the hood.
Not reproducible.
During tif files processing folowing fatal error ocurring for some of the files
Versions:
I also set
export LC_ALL=C
tesseract.setOcrEngineMode(1);