Adalab / proyecto-da-promo-b-modulo-3-team-1

0 stars 0 forks source link

README TEAM 1: DATA ANALYST PROJECT

Work on a dataset of ABC Corporation employees.

Data Analysis Proyect

project-image

Carrying out a complex data analysis process consisting of several phases, which will be explained below.

INDEX

INTRODUCTION

Our mission is to identify key factors that influence job satisfaction and ultimately employee retention. To this end, we have carried out a complex data analysis process including: EDA process, data transformation, A/B Testing, visualisations, creation of a MySQL database and ETL process.

FILES

Files required for project review:

REQUIREMENTS

Make sure you have the following libraries installed in your Python environment:

  1. pandas
  2. numpy
  3. matplotlib
  4. seaborn
  5. scikit-learn
  6. mysql connector
  7. scipy stats, chi2_contingency

If you do not have these libraries, you can install them using pip install

THE PROCESS

Built with

Technologies used in the project:

First phase: deep data exploration

Importing libraries and loading data:

Importing and use of pandas to load CSV files into DataFrames.

project-image

General exploration

Second phase: data transformation

Third phase: visualization

Study of six real-world questions about the data and their representation through graphs.

Fourth phase: DataBase

Creation of a DB (with clean DF) in MySQL Workbench, editing tables and their corresponding relations/restrictions. Lastly, creation of the DB diagram.

Fifth phase: ETL

Data extraction, transformation and loading (ETL): -automation of the data insertion into the DB and the information transformation process to ensure that information is updated and inserted in a consistent manner.

Author

Made with 💜 by [Belén V N (https://github.com/BelenVN), Gloria L C (https://github.com/GloriaLopezChinarro), Viviana V R (https://github.com/Viviana1988) y Cristina R H (https://github.com/cristinarull14)]

ENJOY IT 🤩