raffaeldantas / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
1 stars 0 forks source link

How to remove small fonts in Images #1305

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Use Tesseract OCR in any platform.
2. Use an image which has bullets & numbering in small fonts
3. The output contains the numbering 

What is the expected output? What do you see instead?
Expected an output without numbering i.e, How to remove the letters with small 
fonts? 

What version of the product are you using? On what operating system?
Tesseract OCR 3.02, Windows 7 64 bit

Please provide any additional information below.

Any commands to restrict the tesseract to read the characters in above 
mentioned font size?

And any idea to instruct the OCR to read the characters in horizontal line by 
line?

Original issue reported on code.google.com by smdk...@gmail.com on 9 Sep 2014 at 5:32

GoogleCodeExporter commented 9 years ago
Do not use issue tracker for asking support. Use user forum for that.
Please read FAQ[1] before posting issue.

[1] https://code.google.com/p/tesseract-ocr/wiki/FAQ#Rules_and_advices

Original comment by zde...@gmail.com on 9 Sep 2014 at 11:30