-
- [x] Correlation checking to identify most relevant features
- [x] Boolean columns visualization and interpretation
- [x] Final step: Conclusions on data quality and possible data processing
- […
-
We need to perform (at least a little) EDA so we can train a decently performant model.
-
### Have you completed your first issue?
- [X] I have completed my first issue
### Guidelines
- [X] I have read the guidelines
- [X] I have the link to my latest merged PR
### Latest Merged PR Lin…
-
- clean up the code
- add more comments
-
Related to #10 . For all patients identified via PLUNC from 2016 forward, provide patient counts by the following variables:
1. Risk Category: SR – Standard risk vs HR – High risk
2. Physician
3. …
-
-
Milestone 2: Selection of data set(s), exploratory data analysis (soft deadline 28.10, firm deadline 04.11)
-
import seaborn as sns
import pandas as pd
def perform_eda(data):
sns.pairplot(data)
plt.show()
-
A comprehensive guide for conducting EDA on sports-related datasets. It aims to equip users with the skills necessary to uncover insights and patterns from raw data, leveraging various statistical and…
-
The team would like to understand damage in the country from 2022-2024. They want to conduct an exploratory analysis to identify locations of illicit mining and change in construction activity in Myan…