medialab / sandcrawler

sandcrawler.js - the server-side scraping companion.
http://medialab.github.io/sandcrawler/
GNU Lesser General Public License v3.0
107 stars 12 forks source link

sandcrawler

sandcrawler.js is a node library aiming at providing developers with concise but exhaustive tools to scrape the web.

Disclaimer: this library is an unreleased work in progress.

The library's full documentation is available on github pages.

Contribution

Build Status

Contributions are more than welcome. Feel free to submit any pull request as long as you added unit tests if relevant and passed them all.

To install the development environment, clone your fork and use the following commands:

# Install dependencies
npm install

# Testing
npm test

Authors

sandcrawler.js is being developed by Guillaume Plique @ SciencesPo - médialab.

Logo by Daniele Guido.