Open exalate-issue-sync[bot] opened 1 year ago
Erin LeDell commented: Hey [~accountid:5e43370f5a495e0c91a74ebe] this is something we could try out and benchmark on some imbalanced datasets… however, if you wanted to explore more sophisticated ideas for handling class imbalance, we could expand the scope of the ticket. I thought having this simple rule would be a low-tech “solution” to start with since we currently don’t do anything to address class imbalance in AutoML.
JIRA Issue Migration Info
Jira Issue: PUBDEV-4744 Assignee: Tomas Fryda Reporter: Erin LeDell State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A
If there is more than a 10:1 imbalance in the response column, let's turn on balance_classes = TRUE for all the models in AutoML. We should also consider exposing the balance_classes arg (set to "AUTO" by default) and the other related arguments, class_sampling_factors = NULL, max_after_balance_size = 5.