s7loves / pesta

Automatically exported from code.google.com/p/pesta
0 stars 0 forks source link

Dependency on java nekohtml #6

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
Pesta is still dependent on nekohtml for html parsing. Native support
should be implemented.

Original issue reported on code.google.com by seanlinmt on 23 Nov 2008 at 2:25

GoogleCodeExporter commented 8 years ago

Original comment by seanlinmt on 21 Aug 2009 at 6:37

GoogleCodeExporter commented 8 years ago

Original comment by seanlinmt on 21 Aug 2009 at 6:38

GoogleCodeExporter commented 8 years ago

Original comment by seanlinmt on 21 Aug 2009 at 6:40

GoogleCodeExporter commented 8 years ago
There's a SGML Reader from Microsoft here:
http://code.msdn.microsoft.com/SgmlReader

That will read, normalize and present HTML in a well-formed manner via an 
XmlReader.  
I've used it more than a few times to process HTML now, and it works reasonably 
well.

Original comment by dave.bac...@gmail.com on 21 Jan 2010 at 11:43

GoogleCodeExporter commented 8 years ago
That looks good. Are you by any chance interested in helping out?

Original comment by seanlinmt on 6 Feb 2010 at 3:04