petermr / norma

Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML
Apache License 2.0
2 stars 4 forks source link

Normalize diacritics #6

Open petermr opened 9 years ago

petermr commented 9 years ago

Add normalization of diacritics, e.g. "e"+"combining- acute" ==> "eacute" (Unicode has tools for this)