OCR-D / page-to-alto

Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)
Apache License 2.0
13 stars 5 forks source link

convert_text: ignore non-text regions #26

Closed bertsky closed 2 years ago

bertsky commented 2 years ago

Fixes #25

bertsky commented 2 years ago

1699189

sorry about that!

(We should also create a PR to ocr-fileformat after merging here...)