christophM / interpretable-ml-book

Book about interpretable machine learning
https://christophm.github.io/interpretable-ml-book/
Other
4.72k stars 1.04k forks source link

Permutation Feature Importance Disruption Guarantee #320

Open DarioS opened 2 years ago

DarioS commented 2 years ago

Section 8.5 has

... we permuted the feature’s values, which breaks the relationship between the feature and the true outcome.

However, I think that permutation could rarely end up with a similar distribution of per-class measurements as the original data and hide the performance loss. Would a more robust method be to calculate the mean of a feature and set all samples to that value?