Closed wking closed 11 years ago
I think a proper fix for this issue would be to restructure the whole output framework to be more line-based (to make it easier to figure out where preceding whitespace comes from, and make it easier to strip trailing whitespace), but that's too big a task for me to commit to at the moment.
html2text has problems when the HTML to parse starts off with:
It works fine with
This problem was acknowledged in #9 https://github.com/aaronsw/html2text/issues/9#issuecomment-8735046
html2text's parsing procedure is a bit opaque to me, so this may not be the cleanest fix, but it does work.