david-cortes / isotree

(Python, R, C/C++) Isolation Forest and variations such as SCiForest and EIF, with some additions (outlier detection + similarity + NA imputation)
https://isotree.readthedocs.io
BSD 2-Clause "Simplified" License
186 stars 38 forks source link

Question on the anomaly score result: which column (attribute) contributes the most? #35

Closed Shea1986 closed 2 years ago

Shea1986 commented 2 years ago

Great work on isotree. May I ask you a question? Is there a way to tell which column or columns contributes the most to the high anomaly score, in high dimensional data? It would be tremendously helpful in medical fields.

david-cortes commented 2 years ago

There's no such functionality in this package but you can use black-box prediction explainers from other libraries based on e.g. SHAP or local sensitivity.