UB-Mannheim / ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
https://digi.bib.uni-mannheim.de/ocr-fileformat/
MIT License
176 stars 23 forks source link

[feature request] Support TSV format #181

Open stweil opened 5 months ago

stweil commented 5 months ago

Is there a need to add support for the TSV format to ocr-fileformat? https://github.com/qurator-spk/page2tsv provides conversion from PAGE XML to TSV. Maybe it is sufficient to know that and use it separately.