eafer / rdrview

Firefox Reader View as a command line tool
Apache License 2.0
836 stars 35 forks source link

Umlauts not displayed through rdrview #21

Closed sdsddsd1 closed 2 years ago

sdsddsd1 commented 2 years ago

Umlauts are displayed as fragments when a page is extracted through rdrview. Example: https://www.heise.de/news/Abschaltungen-von-Mining-Farmen-in-China-Gefahr-oder-Chance-fuer-Bitcoin-6160001.html
In my.mailcap is

text/html; /usr/local/bin/lynx --dump %s; copiousoutput; description=HTML Text; nametemplate=%s.html

Snippet in rdrview:

...Kryptowährung, die anfällig für eine...

Snippet through lynx --dump:

...Kryptowährung, die anfällig für eine 51-Prozent...
eafer commented 2 years ago

I can't reproduce. I seem to recall an encoding problem with old versions of lynx, which version are you using? I'm on 2.9.0dev.6.

Also, have you tried other browsers?

sdsddsd1 commented 2 years ago

Apologies, I have missed your response. My lynx version is 2.8.9rel.1. I have to try a later version. I have tested with w3m 0.5. and the umlauts are displayed correctly.

sdsddsd1 commented 2 years ago

Lynx version 2.9.0.dev.6 displays the umlauts in the correct way. So it seems to be a problem with lynx. As mentioned w3m also works correctly. Closing, thx!