kba / hocr-spec

The hOCR Embedded OCR Workflow and Output Format
http://kba.github.io/hocr-spec/1.2/
72 stars 20 forks source link

Examples in 2.2 Definitions? #90

Open kba opened 7 years ago

kba commented 7 years ago

https://github.com/kba/hocr-spec/pull/84#issuecomment-256723653:

Section 2.2: The abstract description is followed by a specific example with ocr_page, bbox, ocrp_poly. However, it is not yet showing what is described above. Maybe we can extend it to an example with a note, i.e.

An hOCR element (in the following: element) is any HTML tag with a class attribute that contains exactly one class name that starts with ocr_ or ocrx_. Non-OCR related HTML content must not use class names that begin with ocr_ or ocrx_.

Example: <span class="ocr_page"> Note: When referring to an HTML tag with class ocr_page, this spec uses the notation <ocr_page>