Closed GoogleCodeExporter closed 9 years ago
See https://code.google.com/p/tesseract-ocr/wiki/FAQ#Rules_and_advices
Original comment by zde...@gmail.com
on 12 Jul 2013 at 7:43
From the DEV list
On Mon, Jul 15, 2013 at 10:01 AM, Ray Smith <theray.....> wrote:
The idea of shape clustering is that it should help to resolve exactly the errors that you observe! It doesn't work too well at the moment though for most languages. It currently should not be used except for the Indic languages, where it does seem to help.
Ray.
On Sun, Jul 14, 2013 at 7:54 PM, Shane Wee <sw......> wrote:
I am using tesseract 3.0.2, I trained my data with shapeclustering included, the result is not as good comparing with the traineddata I got from excluding shapeclustering.
Shapeclutering seems to cause error recognition on similar shape character such as 1 and I, O and Q, 5 and S.
I am quite sure I follow the training steps correctly.
My question is whether shapeclustering is really important? If I exclude it from my training, will I miss out anything important?
Original comment by shreeshrii
on 16 Jul 2013 at 4:16
Thank you so much for the info ! :)
Original comment by swe...@gmail.com
on 16 Jul 2013 at 7:05
Original issue reported on code.google.com by
swe...@gmail.com
on 11 Jul 2013 at 4:50