jacklicn / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

Offset after combine_tessdata comand ?! ... #527

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1.combine_tessdata
2.
3.

What is the expected output? What do you see instead?

[lang].traineddata

What version of the product are you using? On what operating system?
Tesseract 3.01 on CentOS

Please provide any additional information below.
After I run the command : combine_tessdata, appear this:

Combining tessdata files
TessdataManager combined tesseract data files.
Offset for type 0 is -1
Offset for type 1 is 108
Offset for type 2 is 1407
Offset for type 3 is 2021
Offset for type 4 is 311249
Offset for type 5 is 311437
Offset for type 6 is -1
Offset for type 7 is 317891
Offset for type 8 is -1
Offset for type 9 is 317941
Offset for type 10 is -1
Offset for type 11 is -1
Offset for type 12 is -1

Can anyone to tell me which file represent every type from offset ?

Thank you very much!

Original issue reported on code.google.com by simion.zafiu on 4 Aug 2011 at 3:00

GoogleCodeExporter commented 9 years ago
first of all: training for 3.01 (or changes for 3.01) is not documented. If you 
try to train for 3.01 - you should be ready to experiment, test and read the 
code.

Next: list of files is here: 
http://code.google.com/p/tesseract-ocr/source/browse/trunk/ccutil/tessdatamanage
r.h#29
"-1" indicate that file is not present.

Last: please do not use issue tracker for questions. There is tesseract forum 
for it.

Original comment by zde...@gmail.com on 5 Aug 2011 at 6:30