UBC-MDS / data-analysis-review-2023

0 stars 0 forks source link

Submission: Group_7: Bank Marketing Prediction #13

Open gtmx23 opened 7 months ago

gtmx23 commented 7 months ago

Submitting authors: @gtmx23 @riyaeliza123 @Owl64901 @charlesxch

Repository: https://github.com/UBC-MDS/Group_7_Project.git Report link: https://ubc-mds.github.io/Group_7_Project/bank_marketing_prediction.html Abstract/executive summary: In this project, we aimed to use customer information from a phone-call based direct marketing campaign of a Portugese banking institution to predict whether customers would subscribe to the product offered, a term deposit. We applied several classification based models (k-NN, SVM, logistic regression and random forest) to our dataset to find the model which best fit our data, eventually settling on the random forest model, which performed the best among all the models tested, with an F-beta score with beta = 5 of 0.82, and an accuracy of 0.677 on the test data.

While this was the best performing model out of the models tested, its accuracy still left much to be desired. This indicates that perhaps more data is needed to accurately predict whether customers would subscribe to the term deposit. Future studies may also consider using more features, a different set of features which might be more relevant to whether customers will subscribe, or utilising feature engineering to obtain features which might be more useful in helping to predict whether customers would subscribe to the service.

Editor: @ttimbers Reviewer: Scout McKee, Rafe Chang, Koray Tecimer, Hongyang Zhang

scout-mckee commented 7 months ago

Data analysis review checklist

Reviewer: @scout-mckee

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 1 hour

Review Comments:

Please provide more detailed feedback here on what was done particularly well, and what could be improved. It is especially important to elaborate on items that you were not able to check off in the list above. This is a great project! Here is some feedback:

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.

korayt commented 7 months ago

Data analysis review checklist

Reviewer: Koray Tecimer - @korayt

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 1

Review Comments:

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.

alexzhang0825 commented 7 months ago

title: "Peer Review" output: pdf_document date: "2023-12-04"

Data analysis review checklist

Reviewer:

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 1

Review Comments:

  1. It is good to have a candidate of models to compete against each other. Also the presentation of result is very detailed and comprehensive with different plots showing different models.

  2. Good job noting and handling the class imbalance.

  3. There is only the creative commons license to cover the fair use of the report. You need another one to cover that of the source code (MIT license).

  4. The instruction in your repository, specifically the console commands contain a $ sign at the start of each command. If someone were to copy it using the button into the console it would throw an error. Remove it for a quality of life improvement.

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.

rafecchang commented 7 months ago

Data analysis review checklist

Reviewer: @rafecchang

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 1.5

Review Comments:

Please provide more detailed feedback here on what was done particularly well, and what could be improved. It is especially important to elaborate on items that you were not able to check off in the list above.

This is really well written and easy to follow along!

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.