Calculation of SHAP values for continuous and categorical variables

ModelOriented / survex

Explainable Machine Learning in Survival Analysis

GNU General Public License v3.0

94 stars 10 forks source link

Hi @lzxcvn, where did you find the categorical_variables parameter?

In most cases, SHAP does not distinguish between continuous and categorical variables. It might be important when conditional imputation is used for feature marginalization (instead of the default marginal feature distribution). For details, refer to the shapr R package https://github.com/NorskRegnesentral/shapr, and the related research e.g. https://doi.org/10.1007/s10618-024-01016-z.

Moreover, KernelSHAP is an approximation algorithm that includes randomness, which can lead to changes in the order of importance of the variables.

ModelOriented / survex

Calculation of SHAP values for continuous and categorical variables #93