Open okothchristopher opened 1 year ago
I have sent you an email on some of the uses cases to help you spruce up your final report. These are centered around,
The uses of being able to predict the probability of having a bank account based on demographic data.
To surmise:
You need to have sections in your notebook i.e
Why LabelEncoder, and not a combination of LabelEncoder and OneHot Encoder for appropriate scenarios. Look at the below articles for more:
What is the rationale behind doing the correlation plot and what insights have you gathered from that.
Class imbalance treatment, You mention that In this case i decided to do oversampling, as it gives more on model performance as opposed to downsampling. This is not a good enough reason why one would opt for oversampling. Please re check on class imbalance techniques, each with their own advantages and disadvantages, the choose the most appropriate, eg SMOTE.
\\