tesseract-ocr / tesstrain

Train Tesseract LSTM with make
Apache License 2.0
637 stars 188 forks source link

Add exception handling for empty GT files #41

Closed wrznr closed 5 years ago

wrznr commented 5 years ago

Right now:

Traceback (most recent call last):
  File "generate_line_box.py", line 41, in <module>
    if not unicodedata.combining(line[-1]):
IndexError: string index out of range
Makefile:111: recipe for target 'data/ground-truth/example.box' failed