pallih / fasteignamat-functions

Functions to scrape info on landnr and fastanr from skra.is
The Unlicense
6 stars 3 forks source link

Node.js version #3

Closed ZJONSSON closed 11 years ago

ZJONSSON commented 11 years ago

Experimental Node.js version of the same.

ZJONSSON commented 11 years ago

Python: Thinking out loud it might make sense to look at https://pypi.python.org/pypi/pyquery as an alternative to lxml.html in python scrapers (caveat: haven't tested this). By using CSS3 / jQueryish selectors it's easier to move algorithms between various platforms.

pallih commented 11 years ago

True. I like lxml because it's fast and I (unlike most people) like xpaths.

pallih@gogn.in | http://gogn.in | http://twitter.com/pallih | https://github.com/pallih

PGP: C266 603E 9918 A38B F11D 9F9B E721 347C 45B1 04E9 On Mar 29, 2013 2:15 AM, "Ziggy Jonsson" notifications@github.com wrote:

Python: Thinking out loud it might make sense to look at https://pypi.python.org/pypi/pyquery as an alternative to lxml.html in python scrapers (caveat: haven't tested this). By using CSS3 / jQueryish selectors it's easier to move algorithms between various platforms.

— Reply to this email directly or view it on GitHubhttps://github.com/pallih/fasteignamat-functions/pull/3#issuecomment-15625858 .

ZJONSSON commented 11 years ago

Agree the importance of speed. For node.js I pick Cheerio rather than JSDOM for the same reason. http://vimeo.com/31950192?t=7m33s