jacklicn / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

font_properties #475

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?

1.According to wiki instruction = where <fontname> is a string naming the font 
(no spaces allowed!), and <italic>, <bold>, <fixed>, <serif> and <fraktur> are 
all simple 0 or 1 flags indicating whether the font has the named property."  
With reference to  "no spaces allowed!" how to do in the following cirumstances?

for example;  Some of fonts containing spaces  under "format" are as follow

Arial Black,  BRH kannada,  BRH Bengali,  BRH Bengali RN,  Comic  Sans MS,  MS 
Serif, 
Times New Roman
It could clearly be seen that there are spaces in the font  resulting 
mftraining is icrashed with error 
as "!  ... has no defined properties. !missing font_properties  entry is fatal 
error!"
Further only <regular> <bold> <italic> <bolditalic> are available but <serif> 
<fixed> <fraktur>
are not available.
Early clarification is requested.unable to generate traineddata due to failure 
of mftraining, All types of method like BRH_Kannada etc used but still not 
succeeded.

What version of the product are you using? On what operating system?
tesseract OCR 578 WinXP(with sp3)

Please provide any additional information below.
Never faced such problems in the previous versions. Requested to clarify the 
position with example of commandline.  

Original issue reported on code.google.com by withbles...@gmail.com on 8 Apr 2011 at 11:12

GoogleCodeExporter commented 9 years ago
Why you can not use "ArialBlack" or "Arial_Black" instead of "Arial Black"? The 
space in name is useless for training or OCR...

Original comment by zde...@gmail.com on 13 Jun 2011 at 7:26