UB-Mannheim / ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
https://digi.bib.uni-mannheim.de/ocr-fileformat/
MIT License
176 stars 23 forks source link

Various (simple) Transformations #54

Open zuphilip opened 7 years ago

zuphilip commented 7 years ago

We can check out https://github.com/cneud/ocr-conversion-scripts but have to be careful about the attributions and licenses.