Open ryanfb opened 9 years ago
For hlig
support, it seems like we would need to modify Tesseract's text2image
to support a new e.g. --opentype_features
argument that could call pango_ot_ruleset_add_feature
with the corresponding PangoOTTag
s (like Pango's syriac-fc.c
does).
Passing
--ligatures
totext2image
gives us st ligatures in Cardo and EB Garamond, as well as ffi ffl (more?) in Garamond. If we could find a way to conditionally turn on thehlig
OpenType font feature in Garamond we should be able to get ct ligatures out of it (and maybe Cardo?).Wyld has ligatures mapped into the ASCII characters:
ÌËÊÉÈÇÅÄÃÂÁÀ
So we'd need to runtext2image
against these then substitute them back to the characters we want before running training.