Closed lshpaner closed 2 months ago
Line 1207 function get_cross_validate(), variable stratify is redundant
We need to make a note in the documentation that stratify_cols cannot be used when using cross_validation
Checked with debugger code changes! works!
We need to make a note in the documentation that stratify_cols cannot be used when using cross_validation
done
Description:
Currently, the
train_val_test_split
method allows for stratification either by y (stratify_y
) or by specified columns (stratify_cols
), but not both at the same time. There are use cases where stratification by both the target variable (y) and specific columns is necessary to ensure a balanced and representative split across different data segments.Proposed Enhancement:
Modify the method to support simultaneous stratification by both y and
stratify_cols
. This can be achieved by combining the stratification keys or implementing logic that ensures both y and the specified columns are considered during the stratification process.Current Method Implementation: