kba / hocr-spec

The hOCR Embedded OCR Workflow and Output Format
http://kba.github.io/hocr-spec/1.2/
72 stars 20 forks source link

What's the purpose of ocrx_cinfo? #69

Open kba opened 7 years ago

kba commented 7 years ago

Spec says

  * ocrx_cinfo should nest inside ocrx_line
  * ocrx_cinfo should contain only x_confs, x_bboxes, and cuts attributes

but not what ocrx_cinfo actually is.

amitdo commented 7 years ago

It's not clear whether Tom really wanted both ocr_cinfo and ocrx_cinfo.

kba commented 7 years ago

IIUC:

Since neither ocr_cinfo nor ocrx_cinfo seem to have semantics beyond "can contain character-level coordinates", ocrx_cinfo seems redundant.

amitdo commented 7 years ago

Related, should be rebased: https://github.com/kba/hocr-spec/commit/6fdbbbf28