nipunbatra / ml-teaching

https://nipunbatra.github.io/ml-teaching/
5 stars 16 forks source link

Added Imbalanced Dataset experiments' first draft. #32

Closed RahulVC02 closed 7 months ago

RahulVC02 commented 7 months ago

This PR adds a notebook called "imbalanced-dataset.ipynb". It has the following two experiments-

  1. Classification (90:10 class ratio in the dataset) - Compares the Accuracy and F1 Score for sklearn's RandomForestClassifier and DummyClassifier (in 'stratified' mode). The accuracies are comparable but the F1 score is very poor for the DummyClassifier as compared to RandomForestClassifier.

  2. Regression (Linear Data + Normal noise) - Compares the mean squared error between sklearn's RandomForestRegressor and DummyRegressor (in 'mean' mode). The MSE values are comparable despite having vastly different predictions on a per-sample level.

review-notebook-app[bot] commented 7 months ago

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

nipunbatra commented 7 months ago

Merged. Thanks @RahulVC02