coalitionforopendataeducation / open-data-etl-tutorial

A tutorial using Pentaho's data integration tool (Kettle) to setup automated Extract-Transform-Load (ETL) jobs for an open data portal.
Other
0 stars 0 forks source link

Open Data ETL Tutorial

Instructions

Extract Data

Host Name: mysql.tomschenkjr.net Database Name: etl_test_database User Name: etl_tutorial Password: ak9gos8dild9orc

Next, extract data from database, renaming the columns into a human-readable form.

SELECT statedata.St AS State, 
       HSGrad AS HighSchoolGrad, 
       Lon AS Longitude, 
       Lat AS Latitude
FROM statedata, statecenter

Installing MySQL Driver

If the test fails, install the MySQL driver. Move the .jar file to $PENTAHO_HOME/libext/JDBC

Prepare Socrata environment

Visit https://communities.socrata.com/catalog/chicago-etl/ and create an account.