devongarde / ssc

static site checker (an opinionated HTML nitpicker)
Other
10 stars 2 forks source link

UNICODE #22

Closed devongarde closed 3 years ago

devongarde commented 3 years ago

The current ssc coverage of unicode is flawed, and only works by good fortune. Make sure it works properly, and add tests to cover it.

devongarde commented 3 years ago

Working on it ...

devongarde commented 3 years ago

An HTML file's charset is now checked against that declared. This resolves one part of the unicode issue, but not that of loading files in bizarro formats.

devongarde commented 3 years ago

Given various charsets are now checked, and most is loaded as UTF-8, then I'm going to close this issue. It is true that some less common charsets will still cause problems. If that matters, I'll open additional issues for them.