While we always rewrite all documents in UTF-8 (as recommended by the ZIM specification), we do not update accordingly the charset declared in the HTML document headers
When present, both the Content-Type and the charset meta should probably be updated to always indicated UTF-8. Most browsers do not care much about these values and have implemented nice fallbacks, so it is not an urgent thing to fix, but probably still important to produce valid HTML documents inside the ZIM.
While we always rewrite all documents in UTF-8 (as recommended by the ZIM specification), we do not update accordingly the charset declared in the HTML document headers
When present, both the
Content-Type
and thecharset
meta should probably be updated to always indicated UTF-8. Most browsers do not care much about these values and have implemented nice fallbacks, so it is not an urgent thing to fix, but probably still important to produce valid HTML documents inside the ZIM.E.g.
Should be fixed to