francisdb / flicklib

Automatically exported from code.google.com/p/flicklib
0 stars 0 forks source link

See for using a html cleaner #22

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
http://www.benmccann.com/dev-blog/java-html-parsing-library-comparison/

should be easier when using one of those with xpath?

Original issue reported on code.google.com by franci...@gmail.com on 26 Jan 2010 at 6:50

GoogleCodeExporter commented 9 years ago
probably, jericho html is very powerful, but it's a bit frustrating to write 
the same navigating code, again and again. However with the recently created 
SimpleXPath class, it's a little bit simpler.

Original comment by gzsom...@gmail.com on 25 Oct 2011 at 1:38

GoogleCodeExporter commented 9 years ago
Another kid on the block: http://jsoup.org/ (available in maven)
Port of the Python beautifulsoup.

Think this is way better than jericho...

Original comment by franci...@gmail.com on 12 Nov 2011 at 4:12