JuliaML / MLDataUtils.jl

Utility package for generating, loading, splitting, and processing Machine Learning datasets
http://mldatautilsjl.readthedocs.io/
Other
102 stars 20 forks source link

CV with balanced sampling #64

Open yuehhua opened 3 years ago

yuehhua commented 3 years ago

To avoid training on imbalanced datasets, we usually sample data to a balance manner and ensure that every fold in cv contains all kinds of labels. I'd like to find API like this, but I find stratified sampling and cross validation independently. Do we have methods to combine them together? Or how to use them collectively?