Alir3z4 / html2text

Convert HTML to Markdown-formatted text.
alir3z4.github.io/html2text/
GNU General Public License v3.0
1.75k stars 270 forks source link

Html character entities adding additional backslashes #331

Open Joseph-33 opened 4 years ago

Joseph-33 commented 4 years ago

Html2text was observed adding additional backslashes before a hyphen when parsing html strings using html2text.html2text(str)

Example

String Input: BEAK&gt; - Brean Down</a>

Expected: BEAK> - Brean Down

Actual: BEAK> \\- Brean Down

There was a similar case for other other character entities as well: BEAK≤ \\- Brean Down