squaredev-io / whitebox

[Not Actively Maintained] Whitebox is an open source E2E ML monitoring platform with edge capabilities that plays nicely with kubernetes
https://squaredev.io/whitebox/
MIT License
183 stars 5 forks source link

Adjust the train_test_split parts in pipelines #91

Open stavrostheocharis opened 1 year ago

stavrostheocharis commented 1 year ago

We use "train_test_split" in "src/analytics/models/pipelines" at lines 40 & 98. In case that after the splitting only one class remains at the train set, an error will follow based on the next calculations (AUC ROC, etc.).

Need to adjust the train_test_split parts and possibly use the "stratify" argument as described inside the docs: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html

aditkay95 commented 1 year ago

I am willing to work on this as my first issue @stavrostheocharis @momegas

momegas commented 1 year ago

Feel free to open a PR. I assign this to you as well.

momegas commented 1 year ago

Hey @aditkay95 please tell me when to move to in progress when you start working on it

Mikehade commented 1 year ago

I would like to work on this issue @momegas and @stavrostheocharis

momegas commented 1 year ago

Im assining this to you