Alir3z4 / html2text

Convert HTML to Markdown-formatted text.
alir3z4.github.io/html2text/
GNU General Public License v3.0
1.85k stars 283 forks source link

TypeError in escape_md #153

Open john-hewitt opened 7 years ago

john-hewitt commented 7 years ago

Version : 2016.9.19

Python Version : 2.7.13

A large number of non-English web pages I'm working with break html2text. Attached is an HTML file that replicates this issue.

import html2text html2text.html2text(open('breaking.txt', 'r').read().decode('utf-8'))

breaking.txt

jdufresne commented 5 years ago

I am not able to reproduce this on the latest master. Perhaps this has been fixed and can be closed.