Request to add functionality for training / test inputs that contain NaN

jasperroebroek / sklearn-quantile

BSD 3-Clause "New" or "Revised" License

18 stars 1 forks source link

The current version of RandomForestRegressor supports nans and missing values in training data, however, sklearn-quantile does not.

If you edit the lines of code (e.g., in quantile.pyx) that use "check_X_y" or "check_array" from sklearn to include force_all_finite="allow-nan" and recompile, then from what I can tell, everything works fine.

However, you also need to replace "np.quantile" with "np.nanquantile" or else nans are output in some of the quantiles.

I have not done extensive testing, but in the use cases with my own data, this appears to work. Maybe someone else will benefit from this too.

jasperroebroek / sklearn-quantile

Request to add functionality for training / test inputs that contain NaN #11