DataViva / dataviva-etl

Extract / Transform / Load Scripts for databases used in Dataviva Project
24 stars 10 forks source link

DataViva ETL

Extract / Transform / Load Scripts for databases used in Dataviva Project.

The sources of Dataviva can be found at Dataviva GitHub.

How it Works

Databases are released by Brazil government as raw data and there is some analysis to extract the information needed for the project.

For each database we have a Wiki documentation that explain all the ETL process.

Setup

  1. You'll need libmagic for this project:

Mac installation

brew install libmagic

Ubuntu (BSD Linux) installation

apt-get install libmagic
  1. Install requirements
pip install -r requirements.txt

See the Wiki for more information.