tesseract-ocr / langdata_lstm

Data used for LSTM model training
Apache License 2.0
114 stars 152 forks source link

Normalize unicode in texts #26

Closed stweil closed 4 years ago

stweil commented 4 years ago

Signed-off-by: Stefan Weil sw@weilnetz.de