jpmml / jpmml-lightgbm

Java library and command-line application for converting LightGBM models to PMML
GNU Affero General Public License v3.0
174 stars 58 forks source link

Command-line option for promoting `NaN` values from invalid to missing status #34

Closed vruusmann closed 4 years ago

vruusmann commented 4 years ago

See https://github.com/jpmml/jpmml-lightgbm/issues/33

Both LightGBM and XGBoost treat NaN as missing values. This is in conflict with (J)PMML conventions, which treat NaN as invalid values.

The solutiion would be to generate MiningField@invalidValueTreatment="asMissing" attributes, ehich would cause the model to promote all invalid values (including NaN) to missing values.