AmitGorvadiya / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

Error in pixReadTiff: pix not read #307

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
When I run:
/usr/local/bin/tesseract eurotext.tif output

I got message:

Tesseract Open Source OCR Engine with Leptonica
Warning in pixReadStreamTiff: tiff page 1 not found
Error in pixReadTiff: pix not read

eurotext.tif (part of tesseract) has only one page.

Anyway tesseract produced output (but error message is misleading)

Original issue reported on code.google.com by zde...@gmail.com on 26 May 2010 at 6:57

GoogleCodeExporter commented 9 years ago
tested with tesseract r370 on Mandriva Linux release 2010.1 (Official) for 
x86_64

Original comment by zde...@gmail.com on 26 May 2010 at 7:44

GoogleCodeExporter commented 9 years ago
I'll look into it; I believe you had another, similar error with a multipage 
tiff 
file?

Original comment by joregan on 26 May 2010 at 8:33

GoogleCodeExporter commented 9 years ago

Original comment by joregan on 26 May 2010 at 9:15

GoogleCodeExporter commented 9 years ago
fixed in revision 373

Original comment by joregan on 26 May 2010 at 11:08

GoogleCodeExporter commented 9 years ago
For one page tif this patch works ok, but when I created multipage tif:
   convert *.tif multi.tif

   /usr/local/bin/tesseract multi.tif output

complains:

Tesseract Open Source OCR Engine with Leptonica
Page 1
Warning in pixReadStreamTiff: tiff page 2 not found
Error in pixReadTiff: pix not read

In output.txt there are both pages ;-)

Original comment by zde...@gmail.com on 27 May 2010 at 6:49

GoogleCodeExporter commented 9 years ago
These extraneous warnings will be removed from Leptonica 1.73, and I've already
removed some of them from Debian by patching 1.72.

Original comment by jbrei...@google.com on 21 Jul 2015 at 4:44