3MTT-PET / Proficiency-Tool

The proficiency evaluation tool for the largest tech talent accelerator in the world
MIT License
7 stars 29 forks source link

Data Science Track: Observation on the Curriculum #5

Open Ayokolapo opened 3 months ago

Ayokolapo commented 3 months ago

The data science lifecycle is missing. This is a critical element of the training as it will help participants put topics of the training in perspective. It can also guide the flow through which this training should follow. See sample life cycle below:

image
Ayokolapo commented 3 months ago

Still on the curriculum, the "Data modelling" section should be called "Predictive modelling". Please see Data science lifecycle image above for context. Data modelling is a word used by data engineers and data analysts and it means something totally different.

Ayokolapo commented 3 months ago

PowerBI/Tableau should be replaced with Seaborn, Matplotlib and (or) Plotly (all python libraries.) These are the visualization tools used by data scientists for Exploratory Data Analysis (EDA) and other forms of analysis. PowerBI/Tableau is primarily for Data analysts.

Aggregation and manipulation in preparation for visualization should be done in Pandas or any other Python library for such.

EmmanuelTheCoder commented 3 months ago

Thanks for raising this important observation. @3MTT Please add the curriculum tag to it.