Closed GoogleCodeExporter closed 9 years ago
I neglected to copy three generated files so they have an "eng" prefix:
cp normproto eng.normproto
cp inttemp eng.inttemp
cp pffmtable eng.pffmtable
There was no Microfeat in my directory, so I concluded it is not needed. After
creating these prefixed fileds, I reran the combine command.
I also determined that I had to deploy the eng.traineddata to
/usr/local/share/tessdata (after copy the existing eng.traineddata that came
with tesseract to preserve a working solution). After deploying
eng.traineddata, I got an an error as follows:
jlpoole@hermes ~/work/tess/samples $ tesseract OCRA_numbers_variety.png output
-l eng
tesseract: unicharmap.cpp:105: bool UNICHARMAP::contains(const char*) const:
Assertion `*unichar_repr != '\0'' failed.
Aborted
jlpoole@hermes ~/work/tess/samples $
Original comment by jlpool...@gmail.com
on 19 Feb 2012 at 12:18
[deleted comment]
tested under tesseract 3.02. attached files which are self explanatory.
It is observed there are misspelling in the name of font in the output text -
even though box file contains correct spelling.Successfully trained Tesseract
to handle OCRA font numeric glyphs except english glyphs. I don't know whether
the expectation of poster is fulfilled.
Original comment by withbles...@gmail.com
on 26 Feb 2012 at 12:13
Attachments:
Since Issue #629 embodies the same problem identified in this Issue #627, I'm
considering this issue closed and am pursuing the matter concerning tesseract
3.02. [Version 681] in Issue #629. I updated my version of tesseract to
today's build and I still had problems. Reference should be made to Issue #629
unless someone advises otherwise.
Thank you.
Original comment by jlpool...@gmail.com
on 26 Feb 2012 at 7:37
reg:"I still had problems" -please elaborate/explain in detail what exact
problems still existed. I like to test after downloading the latest version
r-683 in WinXp.Upload sample text- based on which I can generate tif/box files
myself for testing purpose
and feedback.
Original comment by withbles...@gmail.com
on 27 Feb 2012 at 4:10
When I tried to run tesseract againt a newly built traindata (build 681) I got
this error message instead of output:
jlpoole@themis ~/work/tess/samples_b681 $ tesseract num.ocra.exp0.png output -l
num
tesseract: unicharmap.cpp:105: bool UNICHARMAP::contains(const char*) const:
Assertion `*unichar_repr != '\0'' failed.
Aborted
jlpoole@themis ~/work/tess/samples_b681 $
Original comment by jlpool...@gmail.com
on 27 Feb 2012 at 5:01
[deleted comment]
[deleted comment]
@jlpoole56:
if you have still this problem, please post your files.
Original comment by zde...@gmail.com
on 10 May 2012 at 6:30
I solved my problem in a later bug where I posted a perl script that can be
used to train. This bug may be closed.
Original comment by jlpool...@gmail.com
on 10 May 2012 at 3:24
Original comment by zde...@gmail.com
on 10 May 2012 at 4:29
Original issue reported on code.google.com by
jlpool...@gmail.com
on 18 Feb 2012 at 7:46Attachments: