Currently, the
http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract wiki page
mentions that:
"There is a visual basic tool that you can use (windows only) to make box
file creation much easier. See
http://groups.google.com/group/tesseract-ocr/files and look for
bbtesseract. You can also check out this thread:
http://groups.google.com/group/tesseract-ocr/browse_thread/thread/2321deb561450e
76/554c7a8cec11c073#554c7a8cec11c073
in the forum for more information. Thanks to unkowner for contributing this."
However, for Linux and other systems featuring Python and needed modules,
one can use the excellent GUI tool tesseractTrainer.py contributed in
November 2007 by Catalin Francu, available in the files section of the
tesseract-ocr Google group:
http://groups.google.com/group/tesseract-ocr/files
There's even a screenshot available:
http://tesseract-ocr.googlegroups.com/web/tesseractTrainer.png
I'm attaching the current available version of tesseractTrainer.py here
just in case something bad happens to the version available on the group.
Original issue reported on code.google.com by aleksand...@gmail.com on 18 Jul 2008 at 2:29
Original issue reported on code.google.com by
aleksand...@gmail.com
on 18 Jul 2008 at 2:29Attachments: