This is an Amazon Product Scraper
built using scapy
module of python
it scrape various things
By default it scrapes Mobile Phones
of 5 Pages
from Amazon
.
In case you want to change it to scrape other product, follow the instructions
/amazon_scraper/spiders/amazon_scraper.py
urls
list at line 16
no_of_pages
variable to change number of pages to be scrapedthere are two ways to execute scraper
you can directly execute run.sh
file using shell
sh ./run.sh
you can execute the following command
scrapy crawl amazon_scraper -o ./data/data.json
It will create data.json
file inside the data
folder containing all the scraped data in JSON
format and all the images will be saved in data/img/full
folder.
Already fetched sample data is available in data
folder
If data.json
file doesn't generate in proper format then just delete data.json
file and img
folder.
Now you good to go ;)
scrapy
pillow
[MIT]