dotnet / machinelearning

ML.NET is an open source and cross-platform machine learning framework for .NET.
https://dot.net/ml
MIT License
9.05k stars 1.89k forks source link

weighting of data or oversampling scale or some way to deal with the unbalanced data category problem #6523

Open DrewZero opened 2 years ago

DrewZero commented 2 years ago

Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Describe the solution you'd like A clear and concise description of what you want to happen.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Additional context Add any other context or screenshots about the feature request here.

LittleLittleCloud commented 2 years ago

@luisquintanilla maybe we can add an opiton to re-sample dataset in advanced training option in order to deal with imbalanced data?

Currently, transformer to resample imbalanced data is missing in ml.net, but in the long run, it's definitely worthwhile to implement that.

beccamc commented 1 year ago

@luisquintanilla Please take a look when you have a chance.

luisquintanilla commented 1 year ago

Thanks for looking into this @LittleLittleCloud. We can't implement anything in Model Builder until it's in the framework. Moving it to that repo for now.