dsgt-birdclef / birdclef-2022

Code for the BirdCLEF 2022 competition by the DS@GT team
2 stars 3 forks source link

Implement undersampling of classes in classify-nn model #40

Closed acmiyaguchi closed 2 years ago

acmiyaguchi commented 2 years ago

Classes are very imbalanced, leading to predictions that are skewed in favor of the majority class. One way to resolve this is to implement a very naive undersampling routine.

https://github.com/acmiyaguchi/birdclef-2022/blob/main/notebooks/2022-05-18-acm-training-data-per-species.ipynb https://github.com/acmiyaguchi/birdclef-2022/blob/main/notebooks/2022-05-21-acm-classifier-prediction-dist.ipynb

acmiyaguchi commented 2 years ago

https://machinelearningmastery.com/random-oversampling-and-undersampling-for-imbalanced-classification/