baselines in shap - Githubissues

pytorch / captum

Model interpretability and understanding for PyTorch

BSD 3-Clause "New" or "Revised" License

4.71k stars 475 forks source link

@Luckick Sorry for the late reply! This is great question.

But unfortunately, this is a known limitation of Captum right now. Captum assumes you provides a single baseline whose values are used to replace part of the features. We are aware this is not perfect for 2 use cases:

1st, Shapley may need to sample multiple baseline instances to approximate the distribution the part of "missing" features and the function value should be the expectation of all of them.

2nd, the features may not be independent. So the baselines for the "missing" features depends on the existing unchanged features.

A single baseline cannot support these 2 requirements. We are discussing of supporting pass baseline argument as a function, (input, perturbation_mask) -> baselines, so users can customize how the baselines are sampled. cc @vivekmig @NarineK

pytorch / captum

baselines in shap #1125