UBC-MDS / data-analysis-review-2023

0 stars 0 forks source link

Submission: Group 0: Breast Cancer Predictor #1

Open ttimbers opened 7 months ago

ttimbers commented 7 months ago

Submitting authors: @ttimbers @leem44 @joelostblom

Repository: https://github.com/ttimbers/breast_cancer_predictor_py Report link: https://ttimbers.github.io/breast_cancer_predictor_py/ Abstract/executive summary: Here we attempt to build a classification model using the k-nearest neighbours algorithm which can use breast cancer tumour image measurements to predict whether a newly discovered breast cancer tumour is benign (i.e., is not harmful and does not require treatment) or malignant (i.e., is harmful and requires treatment intervention). Our final classifier performed fairly well on an unseen test data set, with the F2 score, where beta = 2, of 0.96 and an overall accuracy calculated to be 0.96. On the 171 test data cases, it correctly predicted 168. However it incorrectly predicted 3 cases, and importantly these cases were false negatives; predicting that a tumour is benign when in fact it is malignant. These kind of incorrect predictions could have a severly negative impact on a patients health outcome, thus we recommend continuing study to improve this prediction model before it is put into production in the clinic.

Editor: @ttimbers Reviewer: @kvarada @quan3010 @chendaniely