Closed thvitt closed 7 years ago
Certain hyphenation tables use encoding other than ISO-8859-1. To facilitate translation from that particular encoding to UCS, a list of codes and their unicode values can be passed to the hyphenator. See ruhyphal.tex, koicodes.txt for an example of a KOI8-R-encoded hyphenation table and a list of codes. [TeXHyph-J]
HyphenAnnotator initializes this to a 256-byte 1:1 table, but we ship utf-8 encoded files, so there.
We should probably just ignore that table by default.
Fixed in 817586b
When loading the french hyphenation file, the parser fails with this stack trace:
The relevant line fails with
cc == 339 == 0x0153
(œ)