aaronsw / html2text

Convert HTML to Markdown-formatted text.
http://www.aaronsw.com/2002/html2text/
GNU General Public License v3.0
2.63k stars 414 forks source link

Allow paragraphs beginning with an emdash (--) to be word-wrapped. #37

Closed inklesspen closed 12 years ago

inklesspen commented 12 years ago

A friend of mine uses html2text to create plain-text formatted versions of his fiction. He recently found a case where a paragraph starting with an emdash wouldn't be line-wrapped. I've done my best to fix this cleanly, though the fix isn't as elegant as I would have liked. Test included. (The text in the test file has been replaced by some lorem ipsum, since my friend wasn't comfortable with his fiction being included in the html2text repo.)