Closed jtlz2 closed 5 years ago
The hocr-tools are tools for working with hocr files. A transformation from ALTO to hocr is out-of-scope here, but the main purpose of ocr-fileformat. This transformation should already been supported by ocr-fileformat. Let us know there, if you have any problems with that.
Huge thanks and sorry to pollute - see https://github.com/UB-Mannheim/ocr-fileformat/issues/89 where I have described the problem at hand.
I am trying to use your excellent tools to compare alto files from ABBYY and tesseract, but I haven't found a reliable way to convert the alto into hocr in order to do so.
Do you have any plans to support alto input?
I have tried to get ocr-fileformat to do the conversion - so far in vain.
Thanks for all help