jamielynethorpe / ETL-project

Extract, Transform and Load Databases
8 stars 7 forks source link
data-frames database datasets etl hypothesis transformation

ETL-project

We used 3 different datasets from the public platform Kaggle which lead us to the Gun Violence Archive website. The data in the three files included the following information:

The fields of interest include the following:

The following sources for our datasets used:

https://www.kaggle.com/gunviolencearchive/gun-violence-database

https://www.gunviolencearchive.org/reports

https://www.gunviolencearchive.org/mass-shooting

https://www.gunviolencearchive.org/accidental-deaths

Transformation

In order to transform the public data and use it in our study we performed the following:

Load

The last step was to transfer our final output into a Database. We created a database and respective table to match the columns from the final Panda's Data Frame using Postgres database using PG admin to store our original clean data sets. We reconnected to the database and generated additional tables for the data frames.

Summary

There were some limitations to our findings due to the data available. However, we were able to address our hypothesis quetion in our intial project proposal listed in the ETL Project Final Write UP.