viadee / javaAnchorExplainer

Explains machine learning models fast using the Anchor algorithm originally proposed by marcotcr in 2018
BSD 3-Clause "New" or "Revised" License
15 stars 3 forks source link

Non distinguishable anchors #17

Closed TobiasGoerke closed 4 years ago

TobiasGoerke commented 4 years ago

While using the implementation I stumbled upon results which were nothing like I expected and additionally very misleading: I've been dealing with a very unbalanced dataset and a model predicting the same label in the absolute majority of cases. This caused the algorithm to evaluate all candidates in the first round to have a precision of 1. Thereafter, it chose the rule with the highest coverage. The result was thus comprising one feature that was not essential to the model's prediction. While this is expected behavior (and could be approached by configuring the MAB) it is very counter intuitive for novices and we should do something about it.

I propose the following: