UB-Mannheim / ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
https://digi.bib.uni-mannheim.de/ocr-fileformat/
MIT License
181 stars 22 forks source link

Transform ALTO/hocr to plain text #34

Closed kba closed 8 years ago