UB-Mannheim / ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
https://digi.bib.uni-mannheim.de/ocr-fileformat/
MIT License
176 stars 23 forks source link

update textract2page #177

Closed bertsky closed 6 months ago

bertsky commented 6 months ago

(needed because of changes in OCR-D – no ocrd_utils distribution)

also provides much better support for tables