justaddcoffee / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

mftraining: intproto.cpp:602: INT_TEMPLATES_STRUCT* tesseract::Classify::CreateIntTemplates(CLASSES, const UNICHARSET&): Assertion `((IntTemplates)->Class[ClassId] == __null)' failed. #958

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
Feeding in multi character coordinates I guess(not sure because this is the 
first time I am encountering this error).

What is the expected output? What do you see instead?
I expect a done message and generation of the inttemp file.

What version of the product are you using? On what operating system?

I am using Tesseract 3.02 on ubuntu 12.04 lts.

Please provide any additional information below.

I have trained tesseract with my hindi training data and I have named the 
language as hen.
I have made the box files contain the coordinates of simple words instead of 
individual characters to avoid overlapping of boxes. 
I have created the shapeteble using the "shapeclustering" command.
When I proceed to the next step of generating the inttemp using "mftraining" 
the following happens :
/usr/share/tesseract-ocr/tessdata$ mftraining -F font_properties -U unicharset 
-O hen.unicharset hen.lohithindi.exp*.tr
Read shape table shapetable of 13802 shapes
Reading hen.lohithindi.exp0.tr ...
Reading hen.lohithindi.exp10.tr ...
Reading hen.lohithindi.exp1.tr ...
Reading hen.lohithindi.exp2.tr ...
Reading hen.lohithindi.exp3.tr ...
Reading hen.lohithindi.exp4.tr ...
Reading hen.lohithindi.exp5.tr ...
Reading hen.lohithindi.exp6.tr ...
Reading hen.lohithindi.exp7.tr ...
Reading hen.lohithindi.exp8.tr ...
Reading hen.lohithindi.exp9.tr ...
mftraining: intproto.cpp:602: INT_TEMPLATES_STRUCT* 
tesseract::Classify::CreateIntTemplates(CLASSES, const UNICHARSET&): Assertion 
`((IntTemplates)->Class[ClassId] == __null)' failed.
Aborted (core dumped)

Please help me out.
Thanks

I have attached the unicharset file. I am not able to attach other files due to 
size issues.

Original issue reported on code.google.com by abhay...@gmail.com on 25 Jul 2013 at 4:31

GoogleCodeExporter commented 9 years ago

Original comment by abhay...@gmail.com on 25 Jul 2013 at 4:33

Attachments:

GoogleCodeExporter commented 9 years ago
You can use some other online service for sharing rest of files - and post here 
just link to them (please zip/rar/... them first)

Original comment by zde...@gmail.com on 25 Jul 2013 at 3:14

GoogleCodeExporter commented 9 years ago
Hi I have reworked the process and I have all the files required.
The only change is that I have merged all data into a single file.
https://docs.google.com/file/d/0B0B7wTToCL4LbndFYlBaSEdIUzQ/edit?usp=sharing

Original comment by abhay...@gmail.com on 25 Jul 2013 at 4:50