tesseract-ocr / tesstrain

Train Tesseract LSTM with make
Apache License 2.0
626 stars 180 forks source link

make: *** [data/unicharset] Error #20

Closed engahmed1190 closed 6 years ago

engahmed1190 commented 6 years ago

Hi @kba

I have tried the code of tried

combine_tessdata -u /mnt/e/projects/Training_Tesseract/ocrd-train/usr/share/tessdata/foo.traineddata  /mnt/e/projects/Training_Tesseract/ocrd-train/usr/share/tessdata/foo.
Failed to read /mnt/e/projects/Training_Tesseract/ocrd-train/usr/share/tessdata/foo.traineddata
Makefile:98: recipe for target 'data/unicharset' failed

I haven't seen any file called foo.traineddata in the tessdata directory. what do i need to do here

when I changefoo.traineddata with eng.traineddata . this command works fine which is expected as combine_tessdata -u extract exiting traineddata Thanks in advance

Mayshinlyan commented 6 years ago

Hi there,

I am also facing similar issues with Makefile:98. Can someone help please?

capture
kba commented 6 years ago

Problem was that change introduced by #15 prevented creation of unicharset from scratch. For now, we do not set CONTINUE_FROM by default and create unicharset differently depending on whether CONTINUE_FROM is set.

@wrznr has some ideas how to improve that, hopefully we'll get around to implementing them soon.

nebiyebln commented 3 years ago

@kba hi please help me , I'm making a license plate recognition system. I will train tesseract to better read license plate characters. I created a data set for this. There are .tif and .txt files. but when I start the tutorial I get this error: unicharset_extractor: symbol lookup error: unicharset_extractor: undefined symbol: _ZN10UNICHARSET11null_scriptE Makefile:103: recipe for target 'data/unicharset' failed make: * [data/unicharset] Error 127

wrznr commented 3 years ago

@nebiyebln Pls. open an issue for your problem with a more detailed description of your problem, ideally in combination with a sample dataset which helps us to reproduce the problem.