Closed vruusmann closed 3 years ago
The above happens with pandas.Series
boolean columns that contain missing values - in that case there are three category levels in action - False
, True
and float("NaN")
denoting missing values.
The converter could detect such "one extra category"-situations and see if there's a float("NaN")
category level involved. If it is, then it should be "demoted" to a normal missing value.
Encountered the following exception, when training a binary classifier with a sparse dataset that contains a boolean column ("Audit/Deductions"):
The boolean value space contains exactly two scalar values. It's weird that the
values.size() == 1
, didn't fire inPredicateManager#createSimpleSetPredicate(...)
, which suggests that thevalues
variable is either an empty collection, or a two-valued one.