aaronsw / html2text

Convert HTML to Markdown-formatted text.
http://www.aaronsw.com/2002/html2text/
GNU General Public License v3.0
2.63k stars 414 forks source link

Line breaks in bold renders incorrect markdown #107

Open dmauro opened 7 years ago

dmauro commented 7 years ago

Bold generally should not be allowed to have line breaks in between them, or have whitespace on the inside, or they'll be treated as literal asterisks. So for example:

<b>Our bold text<br /></b>

Results in:

**Our bold text\n  **

Which will not render in bold with most markdown interpreters.

Instead we should move trailing line breaks outside of the asterisks:

**Our bold text**\n  

and end and re-start bold for if it's not trailing, eg:

<b>Our multiline<br />bold text</b>

to

**Our multiline**\n  **bold text**