DocNow / diffengine

track changes to the news, where news is anything with an RSS feed
MIT License
177 stars 30 forks source link

Encoding autodetection #88

Closed nahuelhds closed 4 years ago

nahuelhds commented 4 years ago

Now if the page has latin1 or ascii chars, they're auto decoded to UTF-8.

This ensures a better version comparison and it avoids tweeting changes when the actual change is the encoding (happens a lot with https://twitter.com/mp_diff). For example this tweet