tesseract-ocr / tesseract

Tesseract Open Source OCR Engine (main repository)
https://tesseract-ocr.github.io/
Apache License 2.0
62.18k stars 9.5k forks source link

Error in Shell script for some Linux commands #2653

Open ghost opened 5 years ago

ghost commented 5 years ago

I am training tesseract 4 with 6 fonts and near about 50K images and box files. While running the generate_training_data.sh script, i am getting two errors first one is for cp command and other one is for ls command (argument list too long). What is the reason for such errors and what would be the solution. Is it because of large data?

Shreeshrii commented 5 years ago

Try setting

ulimit -s 65536

and see if that helps.

stweil commented 5 years ago

@mayurakewarv2, where did you get generate_training_data.sh from? I don't think it is part of tesseract-ocr/tesseract.