Closed ZJONSSON closed 11 years ago
Python: Thinking out loud it might make sense to look at https://pypi.python.org/pypi/pyquery as an alternative to lxml.html in python scrapers (caveat: haven't tested this). By using CSS3 / jQueryish selectors it's easier to move algorithms between various platforms.
True. I like lxml because it's fast and I (unlike most people) like xpaths.
pallih@gogn.in | http://gogn.in | http://twitter.com/pallih | https://github.com/pallih
PGP: C266 603E 9918 A38B F11D 9F9B E721 347C 45B1 04E9 On Mar 29, 2013 2:15 AM, "Ziggy Jonsson" notifications@github.com wrote:
Python: Thinking out loud it might make sense to look at https://pypi.python.org/pypi/pyquery as an alternative to lxml.html in python scrapers (caveat: haven't tested this). By using CSS3 / jQueryish selectors it's easier to move algorithms between various platforms.
— Reply to this email directly or view it on GitHubhttps://github.com/pallih/fasteignamat-functions/pull/3#issuecomment-15625858 .
Agree the importance of speed. For node.js I pick Cheerio rather than JSDOM for the same reason. http://vimeo.com/31950192?t=7m33s
Experimental Node.js version of the same.