basilleaf / marsfromspace

MarsFromSpace.com - image+content scraper and hacked off-the-shelf Wordpress theme
5 stars 0 forks source link

Scraper for the Mars Reconnaissance Orbiter (HiRISE) website, grabs press release images and content from http://hirise.lpl.arizona.edu/releases/all_captions.php and publishes to a Wordpress blog http://www.marsfromspace.com/about/

New: added a django + tastypie api, if anyone wants to grab all the data we scraped: https://github.com/basilleaf/marsfromspace/tree/master/api

Salmoncream is our hacked WP theme.

Posts up to 5 a day, runs on Heroku scheduler:

heroku run python scrape_to_publish.py page_min page_max

ie:

heroku run python scrape_to_publish.py 1 5