Closed pplonski closed 3 years ago
Should auto ml do upsampling w/o the user concern? Anyway, the model isn't going to learn anything for that class. Isn't it better to inform the user through a warning that stratification is not possible? @pplonski
I've added a function _handle_drastic_imbalance()
that assures that there is always at least 20 (or k_folds
if set) samples per class.
In the case of too few samples to perform stratification there is an error thrown:
Maybe we can detect such situations and upsample minor classes? For sure it is related to https://github.com/mljar/mljar-supervised/issues/157 However, this issue requires rather a quick fix and #157 requires a larger treatment of unbalanced datasets.