Closed GoogleCodeExporter closed 9 years ago
I think there is a problem with your font_properties file. It seems to have a
blank line above, while blank line should be at the end.
I was able to generate the traineddata with your files in jtessboxeditor (I
needed to add the words list, frequent words list and rename the font
properties file to the naming convention needed by the program.
BTW, there is already traineddata for Bangla - please see
https://code.google.com/p/tesseract-ocr/source/browse/ben.traineddata?repo=tessd
ata
and also see
https://code.google.com/p/tesseract-ocr/source/browse?repo=langdata#git%2Fben
Original comment by shreeshrii
on 30 Mar 2015 at 8:50
No, this will not work if I do not leave a blank space in front of the first
line, however, I have the same tif file as input.By the way,
Original comment by m.tawfi...@gmail.com
on 31 Mar 2015 at 2:27
You did not follow instruction[1] e.g. font_properties.txt does not meet
"Requirements for text input files", so I guess you did not created valid
traineddata.
Anyway you issue is invalid, because for support you should use tesseract user
forum. Issues tracker should be only for reporting of google produced
traineddata files.
[1] https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3
Original comment by zde...@gmail.com
on 9 Apr 2015 at 8:06
Original issue reported on code.google.com by
m.tawfi...@gmail.com
on 29 Mar 2015 at 4:03Attachments: