AmitGorvadiya / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

new trained language - unable to load unicharset file error #292

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
I am getting the following error message every time I try to use the new
language I created: "Unable to load unicharset file ./tessdata/abc.unicharset"

This is for my own trained language not a pre-made one (so, it is not a
matter of downloading source training data).

I have meticulously gone through the entire training process multiple
times.  I never get any problems until the end when I try to use it.  Yes,
the file is there but it won't load it for some reason.  I even tried it
with an entirely different tif and box (one far more simple than the first
one I used), and I still get the same problem.  Any unicharset file I
create simply will not load.  I am at a total loss as to how to resolve
this.  I am sure I am missing something.

I have attached the files for the simpler one I created.  If anyone has any
suggestions, I would appreciate it.

Thanks.

What steps will reproduce the problem?
1. follow steps to train tesseract for a new language
2. try running tesseract image1.tif output -l abc
3.

What is the expected output? What do you see instead?
an output txt file of the document.  an error message.

What version of the product are you using? On what operating system?
2.04 on vista 64

Please provide any additional information below.

Original issue reported on code.google.com by wilsonfi...@gmail.com on 11 Apr 2010 at 1:25

Attachments:

GoogleCodeExporter commented 9 years ago
Recently, I am having the same error message with pre-made english and 
portuguese languages for 2.04. Both on Win 7 x86 and x64. Maybe some windows 
update related issue?

Version 3 works fine, but the recognition is worst in my case

Original comment by gab...@gmail.com on 14 Feb 2011 at 1:03

GoogleCodeExporter commented 9 years ago
"Unable to load unicharset file ./tessdata/abc.unicharset" means tesseract is 
not able to open "abc.unicharset" file and it expect it is located in 
"./tessdata" (e.g. in subdirectory tessdata in current directory).

Please check your instalation or try to set TESSDATA_PREFIX variable.

Original comment by zde...@gmail.com on 14 Feb 2011 at 6:54

GoogleCodeExporter commented 9 years ago
can we close issue or do you have some other information?

Original comment by zde...@gmail.com on 27 Jul 2011 at 9:14

GoogleCodeExporter commented 9 years ago
you can close it.

Original comment by wilsonfi...@gmail.com on 27 Jul 2011 at 9:18

GoogleCodeExporter commented 9 years ago

Original comment by zde...@gmail.com on 27 Jul 2011 at 9:29