fcheng00 / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 1 forks source link

hOCR ocr-capabilities sometimes incomplete #1377

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?

1.Run tesseract with tessedit_create_hocr 1

2.Run tesseract with both tessedit_create_hocr 1 and hocr_font_info 1

3. Compare the 'ocr-capabilities list

What is the expected output? What do you see instead?

A sample output is temporarily available at http://teksty.klf.uw.edu.pl/13/

For the first run the list is missing  ocrp_lang, ocrp_dir and ocrp_wconf.

What version of the product are you using? On what operating system?

tesseract 3.04.00 compiled from git on 8 November, Debian sid.

Original issue reported on code.google.com by jsb...@mimuw.edu.pl on 9 Nov 2014 at 5:41

GoogleCodeExporter commented 9 years ago
Can you provide input images?

Original comment by zde...@gmail.com on 7 Feb 2015 at 7:17

GoogleCodeExporter commented 9 years ago
Just uploaded to http://teksty.klf.uw.edu.pl/13/

Original comment by jsb...@mimuw.edu.pl on 7 Feb 2015 at 8:00