ing-bank / probatus

Validation (like Recursive Feature Elimination for SHAP) of (multiclass) classifiers & regressors and data used to develop them.
https://ing-bank.github.io/probatus
MIT License
132 stars 40 forks source link

[WIP] Penalty on shap calculation for higher variance #217

Closed markdregan closed 1 year ago

markdregan commented 1 year ago

PR addresses #216

Overall objective: Add penalty to features that has high variance in underlying shap values - when computing feature importance. This will (in theory) encourage selection of features that have more coherency across CV folds.

Work tasks:

Reviewers: LMK any changes / improvements that can be made.

ReinierKoops commented 1 year ago

Thanks for the PR. I’ll review it once it’s complete. Please have a look at the current GitHub actions run output. Will rerun it once you pushed new changes.

markdregan commented 1 year ago

@ReinierKoops - I seem to have based this PR off some older version of probatus. Just submitted fresh PR. Apologies. I'll delete this PR.