Closed GoogleCodeExporter closed 9 years ago
After testing with different 3 or 4 sample.tif, same problems still exists. In
other words, command line "tesseract xyz.tif xyz batch.nochop makebox" failed to
generate boxes(100%)for full/complete set of fonts image - instead generate
only
boxes(25% to 40%) instead of expected 100% boxes of the fonts of image.
It is felt there must be some bugs in relevant source codes of "batch.nochop
makebox"
- which requires detailed investigation.
Once tesseractOCR succeeded to generate 100% boxes with reference to fonts
image at
initial stage,easily one can generate 8 data files of relevant languages
without any
problems.
Original comment by withbles...@gmail.com
on 4 Sep 2007 at 7:09
These characters are too small to use as training data, or for recognition. You
should be training with characters 20-30 pixels high, equivalent to about 10pt
at 300
dpi. That is 30-40pt at 75-100 dpi screen resolution. This problem is a
duplicate of
issue 61.
Original comment by theraysm...@gmail.com
on 6 Sep 2007 at 12:51
Original issue reported on code.google.com by
withbles...@gmail.com
on 26 Aug 2007 at 5:16Attachments: