aaronsw / html2text

Convert HTML to Markdown-formatted text.
http://www.aaronsw.com/2002/html2text/
GNU General Public License v3.0
2.58k stars 410 forks source link

Best management HTML character entity #38

Closed KanorUbu closed 12 years ago

KanorUbu commented 12 years ago

Hi I have modify the program for to take into consideration the accents in HTML character entity. Example : à → a in aarowsw/master à → à in KanorUbu/master

aaronsw commented 12 years ago

Have you tried just turning UNICODE_SNOB on?

KanorUbu commented 12 years ago

oh i have not see this feature. My pull request is useless. Thank you.

aaronsw commented 12 years ago

If you want to turn it on by default, now might be the time (it's been 12 years...).

KanorUbu commented 12 years ago

Good idea, unicode is standard now.