mbernico / snape

Snape is a convenient artificial dataset generator that wraps sklearn's make_classification and make_regression and then adds in 'realism' features such as complex formating, varying scales, categorical variables, and missing values.
Apache License 2.0
165 stars 21 forks source link

add shuffle option #14

Closed SuryaThiru closed 4 years ago

SuryaThiru commented 4 years ago

sklearn's make_classification and make_regression function by default shuffles the features and samples. For cases where we might want to keep track of the type of features (informative or redundant), this might be undesirable. The commit adds the shuffle option to the configuration file.

mbernico commented 4 years ago

Hi @SuryaThiru Thanks so much for putting this together. We let snape get a little old unfortunately. I just merged a big PR from @tgsmith61591 that fixed Travis amongst other things. Can you please rebase and we will get this merged? Sorry for the delay as well, real life got in my way for a bit. :)

SuryaThiru commented 4 years ago

No problem. Will look into that

SuryaThiru commented 4 years ago

Let me know if anything else needs to be added