Open bmix opened 3 years ago
As far as I can tell, there is no currently-existing tool that does what HTML2XML does, and the obvious ways of writing one (eg. Python BeautifulSoup, HTML Tidy) donʼt actually work right especially around namespaces.
The version here also isnʼt ideal (Iʼm planning to submit another PR about that in a few minutes) but it would be better than everything else I could find, ie. nothing.
The original 1.4 distribution contained some example apps, that could be used from the command line. The author stated:
The sourcecode is in
test-src/nu/validator/htmlparser/tools/
but none of the releases I found on Maven Central has the classes built in. I do have an older JAR, which is also namedhtmlparser-1.4.jar
on disk, from years ago, that had these classes and thus is usable from the CLI.May I kindly ask you, to bring these back, so one can convert HTML into XHTML simply from the command line? Thank you!