gnewtothis101 / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

Tesseract recognizes the characters irrespective of the lines #1306

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Run the Tesseract OCR in Java for the attached image 
2. Save the OCR result in a text file
3. Check the order of the output text file with the attached image.

What is the expected output? What do you see instead?
Expected output -- Expected the result with words in the horizontal left to 
right order.

Actual output   -- Showing words randomly irrespective of the line order.

What version of the product are you using? On what operating system?
Tesseract 3.01 and Windows 7 

Please provide any additional information below.
The input and expected & actual output are attached for reference.

Original issue reported on code.google.com by smdk...@gmail.com on 9 Sep 2014 at 6:40

Attachments:

GoogleCodeExporter commented 9 years ago
we do not support for "running tesseract OCR in Java".
Please read FAQ[1] before posting issue.

[1] https://code.google.com/p/tesseract-ocr/wiki/FAQ#Rules_and_advices

Original comment by zde...@gmail.com on 9 Sep 2014 at 11:29