itwood / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

Different OCR results for windows and CentOS #1497

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
Execute "tesseract pure_text.png pure_text_win" on windows7 and 
"tesseract pure_text.png pure_text_centos" on CentOS6.4

Both window7 and CentOS6.4 have installed tesseract 3.02.02

What is the expected output? What do you see instead?
The two outputs are expected to be identical or very similar. However, they are 
very different: window's result is almost perfect, while the result by CentOS 
is terrible. Please see the attachments for details.

What version of the product are you using? On what operating system?

windows7:
tesseract 3.02
leptonica-1.68 (Mar 14 2011, 10:43:03) [MSC v.1500 LIB Release 32 bit]
libgif 4.1.6 : libjpeg 8c : libpng 1.4.3 : libtiff 3.9.4 : zlib 1.2.5

CentOS6.4
tesseract 3.02.02
leptonica-1.72
 libjpeg 6b (libjpeg-turbo 1.2.1) : libpng 1.2.49 : zlib 1.2.3

Please provide any additional information below.
Tesseract on CentOS is observed to perform worse than windows for many other 
pictures.

Original issue reported on code.google.com by chentaokite on 13 Jul 2015 at 10:09

Attachments:

GoogleCodeExporter commented 9 years ago

Original comment by zde...@gmail.com on 20 Jul 2015 at 8:10