recodehive / Stackoverflow-Analysis

Stack overflow is a professional community for developers. This repo analysis 3 years of developer Survey done by Stackoverflow and do visualization and predict the salary of Data Scientist in future.
https://stackoverflow-analysis.streamlit.app/
MIT License
229 stars 118 forks source link

[Documentation Update]: Adding importance of data cleaning and tools used for it in the readme #380

Open shravya312 opened 1 week ago

shravya312 commented 1 week ago

Is there an existing issue for this?

Issue Description

Data cleaning is one of the important part of the machine learning project

Suggested Change

During surveys there are possibillity of duplicate data.Its a waste of resources to make predictions on the same data. Sometimes Details told to fill in the survey will be left blank thats call NULL. these should be treated before starting predicted So people should be aware of data cleaning

Rationale

It help people to understand why data cleaning's important and gain insights on how to do it

Urgency

High

Record

github-actions[bot] commented 1 week ago

Thank you for creating this issue! 🎉 We'll look into it as soon as possible. In the meantime, please make sure to provide all the necessary details and context. If you have any questions or additional information, feel free to add them here. Your contributions are highly appreciated! 😊

You can also check our CONTRIBUTING.md for guidelines on contributing to this project.

github-actions[bot] commented 1 week ago

Hi there! Thanks for opening this issue. We appreciate your contribution to this open-source project. We aim to respond or assign your issue as soon as possible.

sanjay-kv commented 6 days ago

i think the current readmen is pretty long enough, i dont think this is required. feel free to raise more issues.

shravya312 commented 6 days ago

Okay sure

Ayesha480 commented 7 hours ago

please assign the lable i want to work on it