ocropus / hocr-tools

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
Other
364 stars 79 forks source link

Output hocr files besides images in hocr-cut #156

Open zuphilip opened 5 years ago

zuphilip commented 5 years ago

[hocr-cut] gave me a myimage.left.jpg and myimage.right.jpg I was primarily expecting two hocr files, one for each half (later to be merged with the images to make the hocr-pdf)

Originally posted by @sarangtc in https://github.com/tmbdev/hocr-tools/issues/154#issuecomment-526940809

zuphilip commented 5 years ago

Yes, it outputs currently only the separated images. It would be possible to improve the script such that also a hocr-file for the left and one for the right part will be outputed as it was discussed before, see https://github.com/tmbdev/hocr-tools/pull/108#issuecomment-419455029 . If anyone wants to work on this feature, then let me know. Alternatively, you can also OCR the left and right image separately and thereby probably getting better results.