-
Week 3 Goal
-
I'm doing some webcrawling with selenium (java) and phantomjs. I would like to switch to a headless chrome/chromium.
For each crawl i'm using a different proxy-server with a new phantomjs instance (c…
-
Regarding to my question on stackoverflow ( http://stackoverflow.com/questions/30887701/should-i-use-akka-io-apache-spark-mesos-or-storm-for-a-webcrawling-engine ) i found this and the non-grid plugin…
-
A package index should not:
- take control away from the developer or package maintainer
- depend on a single server or provider
currently the idea to do this is:
- the dev/maintainer controls/hosts …
-
私信 一个李富贵
-
Article:
http://blog.iron.io/2012/08/webcrawling-at-scale-with-nokogiri-and.html
Format:
Found
Expected
Reason
The reason is so you can scale the work horizontally without any additional ef…