mljar / mljar-supervised

Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
https://mljar.com
MIT License
3.05k stars 408 forks source link

Add option to not shuffle rows #6

Closed tmontana closed 4 years ago

tmontana commented 5 years ago

When dealing with time series data it is important to not shuffle rows when performing cross-validation. Would be good to have that as an option.

pplonski commented 5 years ago

Right now, as default the 5-fold cross-validation is used for assesing model's performance. To change this behavior we need to add validation variable to AutoML interface.

pplonski commented 4 years ago

This can be done by setting the _validation variable after AutoML object is constructed. Example:

automl = AutoML()
automl._validation = {"validation_type": "kfold", "k_folds": 5, "shuffle": False}
automl.fit(X, y)