viadee / javaAnchorExplainer

Explains machine learning models fast using the Anchor algorithm originally proposed by marcotcr in 2018
BSD 3-Clause "New" or "Revised" License
15 stars 3 forks source link

Contrastive Explanations #14

Open TobiasGoerke opened 4 years ago

TobiasGoerke commented 4 years ago

Anchors is able to explain any model's decision (e.g. for a label it predicted). However, the explained label does not necessarily have to equal the value the model did actually predict but can be freely chosen.

So, we can force the model to explain a decision it has not made. This would reveil its motivation to classify an instance differently - even though it didn't.

I'd like to start a discussion about how this information could be used.

Surely, visualization is one use-case. Showing some sort of matrix for an explanations that displays which features voted for and which voted against the decision would be possible and helpful. Any more ideas?

fkoehne commented 4 years ago

A new aspect on the upcoming master thesis? Basically I would want both approachs with an integrated conclusion.