kareemu3 / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

Shapeclustering command not giving proper output though creates shapetable file. #1315

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Executed commands 
tesseract test2.times.exp0.tiff test2.times.exp0 batch.nochop makebox

tesseract test2.times.exp0.jpg test2.times.exp0.box  nobatch box.train

unicharset_extractor test2.times.exp0.box

2. Then executed 
shapeclustering -F font_properties -U unicharset -O test2.times.exp0.tr

3. Got the output:
Building master shape table
Computing shape distances...
Stopped with 0 merged, min dist 999.000000
Computing shape distances...
Stopped with 0 merged, min dist 999.000000
Computing shape distances...
Stopped with 0 merged, min dist 999.000000
Computing shape distances...
Stopped with 0 merged, min dist 999.000000
Computing shape distances...
Stopped with 0 merged, min dist 999.000000
Computing shape distances...
Stopped with 0 merged, min dist 999.000000
.......

Master shape_table:Number of shapes = 0 max unichars = 0 number with multiple 
unichars = 0

What is the expected output? What do you see instead?
Something like:
Master shape_table:Number of shapes = 1060 max unichars = 3 number with 
multiple unichars = 16

What version of the product are you using? On what operating system?
I am using Tesseract 3.02.02 on Ubuntu 14.04.

Please provide any additional information below.
Running Tesseract for handwritten like text.

Original issue reported on code.google.com by cool12...@gmail.com on 18 Sep 2014 at 8:58

Attachments:

GoogleCodeExporter commented 9 years ago
I have the same problem in windows 7, any suggestion?

Original comment by summy0...@gmail.com on 11 Nov 2014 at 3:09