Alir3z4 / html2text

Convert HTML to Markdown-formatted text.
alir3z4.github.io/html2text/
GNU General Public License v3.0
1.85k stars 279 forks source link

Line breaks in links even when setting protect_links = true #425

Open seanthegeek opened 3 months ago

seanthegeek commented 3 months ago

Html2text version: 2024.2.26 Python version: 3.11.12

import html2text

h2t = html2text.HTML2Text()
h2t.protect_links = True
markdown = h2t.handle(html)

According to the documentation, this should protect URLs from line breaks, but it does not. It only places angle brackets around the URLs, which is not helpful. Currently the only way to truly avoid line breaks in links is to either use reference-style links, or set body_width to 0, both of which make ugly markdown documents. It would be better if protect_links would allow links to extend beyond the body width, as the documentation implies.