Team-fastML / fastML

A Python package built on sklearn for running a series of classification Algorithms in a faster and easier way.
https://pypi.org/project/fastML/
MIT License
50 stars 23 forks source link

Add Methods to impute missing data points. #13

Closed buabaj closed 4 years ago

buabaj commented 4 years ago

Is your feature request related to a problem? Please describe. we have to work on adding methods to handle missing data. the function will only work with clean, processed data. we can do this by dropping rows that missing data. any other suggestion is welcome.

willakuffo commented 4 years ago

I guess we could also replace the missing data point with the mode, median or even the mean in order to maintain sample size

buabaj commented 4 years ago

sounds better than dropping the whole row

buabaj commented 4 years ago

after a discussion we ended up agreeing to assume users use clean data or we drop rows with missing values which I have successfully implemented.