abhisheks008 / ML-Crate

ML-Crate stands as the ultimate hub for a multitude of exciting ML projects, serving as the go-to resource haven for passionate and dedicated ML enthusiasts!🌟💫 Devfolio URL, https://devfolio.co/projects/mlcrate-98f9
https://quine.sh/repo/abhisheks008-ML-Crate-409463050
MIT License
180 stars 215 forks source link

Automobiles Sales Data Analysis #576

Closed mariam7084 closed 3 weeks ago

mariam7084 commented 4 months ago

ML-Crate Repository (Proposing new issue)

:red_circle: Project Title : Automobiles Sales Data Analysis :red_circle: Aim : Perform EDA :red_circle: Dataset : https://www.kaggle.com/datasets/ddosad/auto-sales-data :red_circle: Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.


📍 Follow the Guidelines to Contribute in the Project :


:red_circle::yellow_circle: Points to Note :


:white_check_mark: To be Mentioned while taking the issue :


Happy Contributing 🚀

All the best. Enjoy your open source journey ahead. 😎

shivansh-2003 commented 1 month ago

Can You Please Assign this issue under SSOC. 2024 Season 3 Shivansh Mahajan Github:- https://github.com/shivansh-2003 Participation ID:- NA I will do EDA of the data set by various statistical methods like IQR , Study Distribution OF Feature and Correlation Matrix. I would train the data in Various ML model to. arrive to the better Accuracy score. I would then feed the data for Feature engineering and then train it with different machine learning models KNN , Random forest , Decision Tree , SVM and Bossting Algorithms . I am well versed with Machine Learning you can check out my linkedin :-https://www.linkedin.com/in/shivansh-mahajan-13227824a/ and Git repository . can u assign me with this issue @abhisheks008 Participation Role:- SSOC Season 3

abhisheks008 commented 1 month ago

Contributions will start from June 1, 2024. Till then please have some patience.

DarkRaiderCB commented 1 month ago

Full name : Sanyog Mishra GitHub Profile Link : https://github.com/DarkRaiderCB Participant ID: NA Approach for this Project : Will perform EDA on dataset provided using techniques like Data cleaning, categorial analysis, finding any outliers, visualisation (correlation matrix, heat maps and more), summary stats., etc. Would utilise feature engineering and use ML algorithms like Linear Regression, Decision Tree, XGBoost, KNN, etc. and will find the best performing model. Tools to be used: Pandas, Numpy, Matplotlib, Scikit Learn, XGBoost. Resume: https://drive.google.com/file/d/1sDVtq69GJd83t4H1-EOlvHEyQc2oat1k/view?usp=drive_link Participant Role: Contributor SSOC Season 3

aryan0931 commented 1 month ago

Name: Aryan Yadav github: https://github.com/aryan0931 Participant id:NA Approach for this project: I will perform Exploratory Data Analysis (EDA) on the provided dataset using techniques such as data cleaning, categorical analysis, outlier detection, and visualization (including correlation matrices, heat maps, and more). This process will involve generating summary statistics and conducting feature engineering to prepare the data for machine learning. For the machine learning analysis, I will utilize algorithms like Linear Regression, Decision Trees, XGBoost, and K-Nearest Neighbors (KNN) to identify the best-performing model. The tools and libraries used for this analysis will include Pandas, NumPy, Matplotlib, Scikit-Learn, and XGBoost. Participant role: SSOC

abhisheks008 commented 1 month ago

Full name : Sanyog Mishra GitHub Profile Link : https://github.com/DarkRaiderCB Participant ID: NA Approach for this Project : Will perform EDA on dataset provided using techniques like Data cleaning, categorial analysis, finding any outliers, visualisation (correlation matrix, heat maps and more), summary stats., etc. Would utilise feature engineering and use ML algorithms like Linear Regression, Decision Tree, XGBoost, KNN, etc. and will find the best performing model. Tools to be used: Pandas, Numpy, Matplotlib, Scikit Learn, XGBoost. Resume: https://drive.google.com/file/d/1sDVtq69GJd83t4H1-EOlvHEyQc2oat1k/view?usp=drive_link Participant Role: Contributor SSOC Season 3

Implement 5-6 models for this project and compare them based on their accuracy scores.

Assigning this issue to you @DarkRaiderCB

DarkRaiderCB commented 1 month ago

Full name : Sanyog Mishra GitHub Profile Link : https://github.com/DarkRaiderCB Participant ID: NA Approach for this Project : Will perform EDA on dataset provided using techniques like Data cleaning, categorial analysis, finding any outliers, visualisation (correlation matrix, heat maps and more), summary stats., etc. Would utilise feature engineering and use ML algorithms like Linear Regression, Decision Tree, XGBoost, KNN, etc. and will find the best performing model. Tools to be used: Pandas, Numpy, Matplotlib, Scikit Learn, XGBoost. Resume: https://drive.google.com/file/d/1sDVtq69GJd83t4H1-EOlvHEyQc2oat1k/view?usp=drive_link Participant Role: Contributor SSOC Season 3

Implement 5-6 models for this project and compare them based on their accuracy scores.

Assigning this issue to you @DarkRaiderCB

Thanks sir!! @abhisheks008

github-actions[bot] commented 3 weeks ago

Hello @DarkRaiderCB! Your issue #576 has been closed. Thank you for your contribution!