Script for downloading, unpacking, and converting online and public (and others to be added soon) datafiles to open format
The goal of this app is to develop a bash interface, modeled after homebrew that will allow anyone free and open access to data that is available on the web. The hope is to make it possible to do three things easily:
$pip install miner
homebrew and R package manager
Please note - miner
is in pre-alpha and these commands are meant to serve only as a preview for future functionality (chances aregood they are currently buggy)
$miner search (or dig) <dataset name> [OPTIONAL: subset name]
$miner search (or dig) uscensus2010
$miner search (or dig) minnesotapublicschools
$miner describe (or assay) <dataset name> [OPTIONAL: subset name]
$miner describe (or assay) uscensus2010
$miner describe (or assay) uscensus2010 nd
#miner install (or extract) <dataset name> [OPTIONAL: subset name]
$miner install (or extract) uscensus2010
$miner install (or extract) uscensus2010 mn
Testing: $nosetests
What is the difference between miner
and dat? We don't miner
as a competitor to dat. Rather, miner
is a parallel and complementary project. Here are what we see as differences:
miner
focuses on using a formula (map) to get raw data files straight from original sources, download, and process them. dat is focused on building collaborative and version controlled datasets.miner
is in early development, it will be fully operational quickly and aims to be a very small application. dat aims to be a much more robust and comprehensive data collaboration tool. miner
makes it possible to pull raw data regularly and note when data is changed (sometimes for good and perhaps sometimes for less good reasons.miner
might be used as one tool to easily dump into dat. It should be one tool in an open data toolkit.One way to look at it is that miner
exists given today's non-standards-based, mixed license, individually/organizationally hosted dataset world. dat could be seen as the forerunner of the open knowledge / open data world.