qixiaobo / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

post processing #429

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Is there any provision for post processing the output, based on textual
data file, using commands of tesseract? This is very necessary for
Indian languages as training has not reached high levels.

Original issue reported on code.google.com by mns...@gmail.com on 15 Jan 2011 at 2:35

GoogleCodeExporter commented 9 years ago
Tesseract output can be improved (in some extent) by provided dictionaries (see 
http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Dictionary_Data_(
Optional) ). Other post-processing you need to do outside of tesseract

Original comment by zde...@gmail.com on 21 Jul 2012 at 4:28