kba / hocr-spec

The hOCR Embedded OCR Workflow and Output Format
http://kba.github.io/hocr-spec/1.2/
72 stars 20 forks source link

Inline Representation: Sections by function, not class #71

Closed kba closed 7 years ago

kba commented 7 years ago

51

amitdo commented 7 years ago

<sub> and <sup> for html 4.01: https://www.w3.org/TR/html401/struct/text.html#h-9.2.3

kba commented 7 years ago

I just find the HTML5 standard way better. I know we have

all tags should be used for the intended purpose (and only for the intended purpose) as defined in the [HTML40] spec.

in there, but we should rather change that than link to an old spec with bad examples:

      H<sub>2</sub>O
      E = mc<sup>2</sup>
      <SPAN lang="fr">M<sup>lle</sup> Dupont</SPAN>

The first two should not use sub/sup at all. None of the tags should be upper-case.