Closed dpellow closed 1 year ago
Note - this is being enforced by an assertion in sklearn's _validate_data
. If the model actually is meant to allow for NaNs you could set force_all_finite="allow-nan"
in the _validate_data
function (line 89 of forest.py)
Theoretically random forest could handle missing values with surrogate splits, however this isn't implemented in sksurv, and AFAIK in sklearn neither. Currently, sksurv includes no model that can deal with missing values implicitly.
I thought that Random forests should be able to handle missing data, but when I train an RSF model with some feature values missing it produces the error
Does the RSF implementation not handle missing values in the input features? Do any of the other models support them?
Version info: