raku-community-modules / URI

Raku realization of URI — Uniform Resource Identifiers handler
Artistic License 2.0
3 stars 14 forks source link

UTF-16 support? #12

Closed tadzik closed 9 years ago

tadzik commented 9 years ago

I got https://github.com/tadzik/Bailador/issues/31 reported for Bailador, is this something that should be fixed on URI's side?

ronaldxs commented 9 years ago

We have gotten confused between UTF-16 and the documented default character set for a web page when none is declared - ISO-8859-1. They happen to have more or less the same encoding for Æ which is C6 for ISO-8859-1 and probably 00C6 for UTF-16. The current URI::Escape has a no_utf8 option which does essentially what you need. You might look at the URI::Escape code and https://en.wikipedia.org/wiki/Latin-1_Supplement_%28Unicode_block%29 for more information. As far as I can tell your requirement is met and this issue should be closed.

tadzik commented 9 years ago

Fair enough, thanks for the reply!