Questions about the results obtained by XAI method

Trusted-AI / AIX360

Interpretability and explainability of data and machine learning models

Apache License 2.0

1.63k stars 307 forks source link

This is a common phenomenon when considering perturbation based XAI methods. In order to estimate the marginal contribution of your input features (pixels in the case of saliency maps), XAI algorithms often perform some sort of sampling in the input space, which can be non-deterministic (especially if a random sampling is done). This is also why explanations obtained using this kind of method needs to be manipulated carefully. What they are really doing is creating a local simple surrogate (linear classifier, small decision tree) to explain one sample, and in order to make sense, this makes the assumption that the decision boundary will be approximated by a simple model.

Trusted-AI / AIX360

Questions about the results obtained by XAI method #167