kareemu3 / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

Cannot use dawg2wordlist with fixed-length dawgs #1308

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?

1. Download chi_tra.traineddata or any other language file that contains a 
fixed-length dawg
2. Unpack it with "combine_tessdata -u chi_tra.traineddata chi_tra."
3. Try to get the word list from the fixed-length dawg "dawg2wordlist 
chi_tra.unicharset chi_tra.fixed-length-dawgs chi_tra.fixed-length-list"

What is the expected output? What do you see instead?

Expected: the file "chi_tra.fixed-length-list including a set of fixed-length 
words
The program crashes on Windows saying it doesn't work anymore. On Linux it 
tells me there's a memory access error.

What version of the product are you using? On what operating system?

Tested in 3.02.02 from the installer on Windows (32 bit), 3.03 on Windows 64 
bit and 3.04 on Linux 64 bit.

Please provide any additional information below.
--

Original issue reported on code.google.com by p.vorb...@gmail.com on 9 Sep 2014 at 4:05

GoogleCodeExporter commented 9 years ago
Fixed length dawgs are deprecated

Original comment by joregan on 13 May 2015 at 5:27