irssi-import / bugs.irssi.org

bugs.irssi.org archive
https://github.com/irssi/irssi/issues
0 stars 0 forks source link

UTF-8 characters in theme files are displayed as question mark (replacement character) #743

Open irssibot opened 14 years ago

irssibot commented 14 years ago

Putting UTF-8 encoded unicode characters (such as £ or ») in any item in a theme file displays each of those characters as a single Unicode Character 'REPLACEMENT CHARACTER' (U+FFFD), i.e. byte sequence 0xEF 0xBF 0xBD in UTF-8.

I don't see any reason why irssi would change them at all... the bytes that go in in the configuration should be the same bytes that go out to the terminal.

The same UTF-8 characters display fine when input directly into irssi, printed from scripts, or received in channel messages.

I have confirmed the same behaviour with another user on freenode/#irssi

irssibot commented 11 years ago

I'm seeing this same issue in 0.8.15 on both Linux and FreeBSD. For example, the » characters in the "aka" theme display as the replacement character despite encoding for both the terminal and the shell being UTF-8. I tried converting the theme file to UTF-8 (it ships encoded ISO-8859), which resulted in the » in the prompt being displayed properly, but the » characters in channel and message windows displayed as part of the theme (eg. as part of join and part notifications) still display as the replacement character.