UB-Mannheim / ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
https://digi.bib.uni-mannheim.de/ocr-fileformat/
MIT License
176 stars 23 forks source link

Add PAGE -> hOCR conversion by @mhug #86

Closed kba closed 5 years ago

kba commented 5 years ago

@mhug has created a little XSLT to convert PAGE to hOCR.

@wrznr @zuphilip @stweil

zuphilip commented 5 years ago

Travis did not run through because Saxon could not been downloaded (?):

wget --progress=bar:force --no-verbose -O "SaxonHE9-8-0-1J.zip" "https://sourceforge.net/projects/saxon/files/Saxon-HE/9.8/SaxonHE9-8-0-1J.zip/download"

No output has been received in the last 10m0s, this potentially indicates a stalled build or something wrong with the build itself.
Check the details on how to adjust your build configuration on: https://docs.travis-ci.com/user/common-build-problems/#Build-times-out-because-no-output-was-received

The build has been terminated
stweil commented 5 years ago

The direct link to a mirror like https://netcologne.dl.sourceforge.net/project/saxon/Saxon-HE/9.8/SaxonHE9-8-0-1J.zip might work better.

zuphilip commented 5 years ago

Okay, I changed the URL as @stweil suggested and Travis is now happy again. I will merge this now. Thank you all for the work!