dmlc / xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Apache License 2.0
25.69k stars 8.67k forks source link

BUG: SHAP values calculated using CPU differ from SHAP values calculated using GPU for XGBOOST #10290

Open madakkmi opened 2 weeks ago

madakkmi commented 2 weeks ago

There are large differences between SHAP values calculated using CPU and GPU for XGBoost models with feature_perturbation='interventional' and model_output='log_loss'. The detailed description of the bug including how to reproduce it and the traceback are provided in The issue seems to be originating at the shap implementation of the xgboost resposiory as mentioned in that issue. It would be great if this can be fixed.

trivialfis commented 2 days ago

cc @RAMitchell .