patcharats / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

failed to create/build freq-dawg and word-dawg for Kannada language. #51

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Error: Illegal malloc request size!
Fatal error: No error trap defined!
Signal_termination_handler called with signal 2001
Signal_exit 30 SIGNAL ABORT. LocCode: 3  SignalCode: 3

2. In wordpad typed wordlist of Kannada language and evenhough
saved as "wordlist.txt" under UTF-8 format but failed to 
generate Dictionary Data viz., freq-dawg and word-dawg
3.

What is the expected output? What do you see instead?
when run the following commands:
    wordlist2dawg frequent_words_list freq-dawg
    wordlist2dawg words_list word-dawg
failed to generate/build files of (1) freq-dawg and (2)word-dawg

What version of the product are you using? On what operating system?
tesseract-2.0  OS:MSwindows 

Please provide any additional information below.
In word pad typed word list of kan(Kannada) language and 
saved as "wordlist.txt" under UTF-8 format but failed. It is not clear
how to create word list and whether two files have to be saved as 
(1) "frequent_words_list.txt" and
(2)"words_list.txt" under format UTF-8 in Notepad?
Screenshots of (sample)word list created for eng (English)will do.

Original issue reported on code.google.com by withbles...@gmail.com on 5 Aug 2007 at 5:53

GoogleCodeExporter commented 9 years ago
frequent_words_list or words_list has to be replaced with your worlist files.

>> e.g. wordlist2dawg wordlist.txt freq-dawg
>> e.g. wordlist2dawg wordlist.txt word-dawg

if you have a bigger wordlist file split into two files:

1. freqlist.txt which contains often used words
and
2. wordlist.txt which contains the rest

Original comment by struther...@gmail.com on 7 Aug 2007 at 11:28

GoogleCodeExporter commented 9 years ago
Thanks. Problems solved.  The issue may please be treated closed.

Original comment by withbles...@gmail.com on 2 Sep 2007 at 5:10

GoogleCodeExporter commented 9 years ago

Original comment by theraysm...@gmail.com on 28 Dec 2008 at 6:37