Open pedromlsreis opened 4 years ago
Getting an ValueError: Input contains NaN, infinity or a value too large for dtype('float32').
error
Getting an
ValueError: Input contains NaN, infinity or a value too large for dtype('float32').
error
Fixed in a120ae892360aed31cf0e15714fcad6be35095d9.
The commit 01f15d8ce26ff6d8a472858e32959a71edc222ec adds a Random Forest classifier to predict the missing values (NaNs) in the categorical columns.
We should look into the classifier hyperparameters and dig a bit more into this. We can try splitting the dataframe in train/valid and CrossValidation to tune our RF classifier hyperparameters.