UB-Mannheim / ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
https://digi.bib.uni-mannheim.de/ocr-fileformat/
MIT License
176 stars 23 forks source link

Update documentation to reflect latest code #100

Closed stweil closed 4 years ago

stweil commented 4 years ago

Signed-off-by: Stefan Weil sw@weilnetz.de

stweil commented 4 years ago

I also updated https://digi.bib.uni-mannheim.de/ocr-fileformat/.

stweil commented 4 years ago

I am just wondering, are you seeing the transformation in this order?

Yes, that's the order with LANG=de_DE.UTF-8. With LANG=C I get a different order which looks like your one.

zuphilip commented 4 years ago

Merge now. :rocket: Thank you!