Defining sampling strategy

nickkunz / smogn

Synthetic Minority Over-Sampling Technique for Regression

GNU General Public License v3.0

312 stars 78 forks source link

Is it possible to use the algorithm to apply upsampling without any downsampling. For example, if I have a dataset with the following distribution of the target feature: 500 Negative Samples 200 Positive Samples 1000 ==0 Samples

Can I set the algorithm to only upsample the number of positive values without affecting the number of negative and equal to zero samples. For example, the output will be

500 Negative Samples 500 Positive Samples 1000 ==0 Samples

I know that in the imblearn.over_sampling.SMOTENC function it is possible to set the 'sampling_strategy' argument to a dictionary where the keys correspond to the targeted classes. The values correspond to the desired number of samples for each targeted class.

nickkunz / smogn

Defining sampling strategy #5