Open OlivierBinette opened 3 years ago
Write a class of functions of the form hocr_* to operate on hocr files.
hocr_*
We have a few functions of the form hocr_from_* to generate hocr files already.
hocr_from_*
Example additional functionality from hocr-tools:
hocr_parse
hocr_words
hocr_paragraphs
hocr_lines
hocr_combine
hocr_split
Write a class of functions of the form
hocr_*
to operate on hocr files.We have a few functions of the form
hocr_from_*
to generate hocr files already.Example additional functionality from hocr-tools:
hocr_parse
orhocr_words
: return the complete hocr file as a tidy tablehocr_paragraphs
: return hocr as a tidy table of paragraphs, ignoring line and word-specific informationhocr_lines
: same as above, but for paragraphshocr_combine
to combine multiple hocr files into onehocr_split
to split hocr files into individual pages