Node.js based scraper for ecommerce sites
This repo is no longer being updated. I've made a new generalized site scraper that is not specific to ecommerce sites. Please refer to this new project: new site
$ npm install product-scraper
Simply require the package and initialize with a url and pass a callback function to receive the data.
var scraper = require('product-scraper');
scraper.init('http://www.amazon.com/gp/product/B00X4WHP5E/', function(data){
console.log(data);
});
If you are a store owner and wondering how to make sure your site is being crawled, using the following attributes on your markup:
[itemprop="title"]
[itemprop="brand"]
[itemprop="description"]
[itemprop="image"]
[itemprop="price"]
Currently, this plugin supports a few top ecommerce sites based on alexa ratings. If you want to add any stores, or just have an idea or feature, go ahead and fork this repo and send me a pull request. I'll be happy to take a look when I can and get back to you.
For any and all issues/bugs, please post a description and code sample to reproduce the problem on the issues page.