abhisheks008 / ML-Crate

ML-Crate stands as the ultimate hub for a multitude of exciting ML projects, serving as the go-to resource haven for passionate and dedicated ML enthusiasts!πŸŒŸπŸ’« Devfolio URL, https://devfolio.co/projects/mlcrate-98f9
https://quine.sh/repo/abhisheks008-ML-Crate-409463050
MIT License
180 stars 215 forks source link

Most Sold Books in the World Analysis #508

Closed abhisheks008 closed 5 months ago

abhisheks008 commented 5 months ago

ML-Crate Repository (Proposing new issue)

:red_circle: Project Title : Most Sold Books in the World Analysis :red_circle: Aim : The aim of this project is to analyze the most sold books in the world using the given dataset. :red_circle: Dataset : https://www.kaggle.com/datasets/rafsunahmad/most-sold-books-in-the-world :red_circle: Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.


πŸ“ Follow the Guidelines to Contribute in the Project :


:red_circle::yellow_circle: Points to Note :


:white_check_mark: To be Mentioned while taking the issue :


Happy Contributing πŸš€

All the best. Enjoy your open source journey ahead. 😎

mariam7084 commented 5 months ago

Hi!! @abhisheks008 Please assign this issue to me.

Full Name : Mariam Github Profile Link : https://github.com/mariam7084 Participant ID : NA Approach for this project : I will perform the EDA of the dataset and visualize the results accordingly. What is your participant role? JWoC 2024

abhisheks008 commented 5 months ago

After performing the EDA implement the following models for this project,

  1. Random forest
  2. Decision tree
  3. Logistic regression
  4. Lasso
  5. Ridge
  6. Gradient boosting
  7. XgBoost
  8. MLP

Check the accuracy scores of the deployed models and find out the best one based on the best accuracy score.

Issue assigned to you @mariam7084

mariam7084 commented 5 months ago

@abhisheks008 the dataset contains the following columns: Books title Author name Language Publish date Sales Genre This dataset can't be used to deploy machine learning models.

abhisheks008 commented 5 months ago

Then go for extensive visualization and data analysis, make the conclusions according to that. Just like an analysis project.