Open maddyloo opened 10 years ago
http://www.bitjungle.com/isoent/ has detailed notes on how to convert back and forth between the different formats. I wonder if we need more than just the basic Latin1 symbols, i.e. will there be math symbols involved?
So far, in the legacy documents, the only symbols that are present are these:
Ã
É
Ü
á
à
&
ä
¤
é
è
í
î
ó
ø
õ
ö
ß
ü
We should be future friendly, but I think covering Latin1 would be enough.
Possibly relevant: https://docs.python.org/2/library/htmllib.html#module-htmlentitydefs