averagehuman / python-html2rest

Convert HTML to reStructuredText
BSD 3-Clause "New" or "Revised" License
26 stars 7 forks source link

Linked are broken #4

Open AlmightyOatmeal opened 8 years ago

AlmightyOatmeal commented 8 years ago

All links come out as this without any references:

`Contact us to learn more`_
lourot commented 8 years ago

This project looks abandonned. I now use Pandoc instead:

$ pandoc --from=html --to=rst < README.html > README.rst

Hope this helps.

AlmightyOatmeal commented 8 years ago

@AurelienLourot: Thank you for the suggestion but that's not going the direction I'm looking for.

The goal is to avoid reliance on third party applications, like Pandoc, so the code will be significantly more portable.

I also found that Pandoc produced some sloppy restructuredtext formatting so I ended up writing my own that serves my needs and relies on no external binaries, just uses lxml (because I like lxml.objectify much more than built-in XML DOM utilities and it's significantly faster).