Closed mariam7084 closed 3 weeks ago
Can You Please Assign this issue under SSOC. 2024 Season 3 Shivansh Mahajan Github:- https://github.com/shivansh-2003 Participation ID:- NA I will do EDA of the data set by various statistical methods like IQR , Study Distribution OF Feature and Correlation Matrix. I would train the data in Various ML model to. arrive to the better Accuracy score. I would then feed the data for Feature engineering and then train it with different machine learning models KNN , Random forest , Decision Tree , SVM and Bossting Algorithms . I am well versed with Machine Learning you can check out my linkedin :-https://www.linkedin.com/in/shivansh-mahajan-13227824a/ and Git repository . can u assign me with this issue @abhisheks008 Participation Role:- SSOC Season 3
Contributions will start from June 1, 2024. Till then please have some patience.
Full name : Sanyog Mishra GitHub Profile Link : https://github.com/DarkRaiderCB Participant ID: NA Approach for this Project : Will perform EDA on dataset provided using techniques like Data cleaning, categorial analysis, finding any outliers, visualisation (correlation matrix, heat maps and more), summary stats., etc. Would utilise feature engineering and use ML algorithms like Linear Regression, Decision Tree, XGBoost, KNN, etc. and will find the best performing model. Tools to be used: Pandas, Numpy, Matplotlib, Scikit Learn, XGBoost. Resume: https://drive.google.com/file/d/1sDVtq69GJd83t4H1-EOlvHEyQc2oat1k/view?usp=drive_link Participant Role: Contributor SSOC Season 3
Name: Aryan Yadav github: https://github.com/aryan0931 Participant id:NA Approach for this project: I will perform Exploratory Data Analysis (EDA) on the provided dataset using techniques such as data cleaning, categorical analysis, outlier detection, and visualization (including correlation matrices, heat maps, and more). This process will involve generating summary statistics and conducting feature engineering to prepare the data for machine learning. For the machine learning analysis, I will utilize algorithms like Linear Regression, Decision Trees, XGBoost, and K-Nearest Neighbors (KNN) to identify the best-performing model. The tools and libraries used for this analysis will include Pandas, NumPy, Matplotlib, Scikit-Learn, and XGBoost. Participant role: SSOC
Full name : Sanyog Mishra GitHub Profile Link : https://github.com/DarkRaiderCB Participant ID: NA Approach for this Project : Will perform EDA on dataset provided using techniques like Data cleaning, categorial analysis, finding any outliers, visualisation (correlation matrix, heat maps and more), summary stats., etc. Would utilise feature engineering and use ML algorithms like Linear Regression, Decision Tree, XGBoost, KNN, etc. and will find the best performing model. Tools to be used: Pandas, Numpy, Matplotlib, Scikit Learn, XGBoost. Resume: https://drive.google.com/file/d/1sDVtq69GJd83t4H1-EOlvHEyQc2oat1k/view?usp=drive_link Participant Role: Contributor SSOC Season 3
Implement 5-6 models for this project and compare them based on their accuracy scores.
Assigning this issue to you @DarkRaiderCB
Full name : Sanyog Mishra GitHub Profile Link : https://github.com/DarkRaiderCB Participant ID: NA Approach for this Project : Will perform EDA on dataset provided using techniques like Data cleaning, categorial analysis, finding any outliers, visualisation (correlation matrix, heat maps and more), summary stats., etc. Would utilise feature engineering and use ML algorithms like Linear Regression, Decision Tree, XGBoost, KNN, etc. and will find the best performing model. Tools to be used: Pandas, Numpy, Matplotlib, Scikit Learn, XGBoost. Resume: https://drive.google.com/file/d/1sDVtq69GJd83t4H1-EOlvHEyQc2oat1k/view?usp=drive_link Participant Role: Contributor SSOC Season 3
Implement 5-6 models for this project and compare them based on their accuracy scores.
Assigning this issue to you @DarkRaiderCB
Thanks sir!! @abhisheks008
Hello @DarkRaiderCB! Your issue #576 has been closed. Thank you for your contribution!
ML-Crate Repository (Proposing new issue)
:red_circle: Project Title : Automobiles Sales Data Analysis :red_circle: Aim : Perform EDA :red_circle: Dataset : https://www.kaggle.com/datasets/ddosad/auto-sales-data :red_circle: Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.
📍 Follow the Guidelines to Contribute in the Project :
requirements.txt
- This file will contain the required packages/libraries to run the project in other machines.Model
folder, theREADME.md
file must be filled up properly, with proper visualizations and conclusions.:red_circle::yellow_circle: Points to Note :
:white_check_mark: To be Mentioned while taking the issue :
Happy Contributing 🚀
All the best. Enjoy your open source journey ahead. 😎