Marijkevandesteene / MachineLearning

repo to share progress and to manage versions of exam MachineLearning (M14)
0 stars 2 forks source link

Applying SMOTE results in negative recall for random forest classifier #45

Open Marijkevandesteene opened 3 days ago

Marijkevandesteene commented 3 days ago

In consolidating the notebook and applying an oversampling technique to the training data this resulted in a negative recall for damage.

Marijkevandesteene commented 3 days ago

While SMOTE can help improve model performance in terms of accuracy, it can sometimes lead to a decrease in recall for the minority class.

This negative impact on recall can occur because SMOTE generates synthetic examples by interpolating between existing minority class instances. As a result, the model may become overly confident in predicting the minority class, leading to more false positives in the predictions and thus reducing the recall for that class.

To mitigate this issue, it's important to carefully evaluate the trade-offs between different performance metrics like recall, precision, and accuracy when using SMOTE. Additionally, tuning the model parameters and considering alternative techniques like adjusting class weights or using more advanced sampling methods may help improve recall without sacrificing other performance metrics.

dluts commented 10 hours ago

Can be marked as done