extend code to feature groups

bdwilliamson / vimpy

Perform inference on algorithm-agnostic variable importance in Python

MIT License

20 stars 5 forks source link

Which function are you using?

For both vim and cv_vim, you should be able to input a vector of indices to the argument s. For your example, if your predictors are [blood pressure, heart rate, sodium, potassium, sugar], you could input s = [0,1] to consider the importance of vitals as a group.

Groups aren't currently set up in spvim. To extend to groups, we would (a) create a partition of the space into the groups (in your example, vitals, labs, and diagnoses), (b) measure predictiveness using each combination of the feature groups [in your example: all variables, no variables, vitals alone, labs alone, diagnoses alone, vitals + labs, vitals + diagnoses, labs + diagnoses], (c) combine together using the formula. The normalization constant would be different than the individual-variable Shapley value.

I don't have time for this at the moment (and I think @jjfeng probably doesn't either -- though she may have thought about it a bit), so if you want to create a PR that would be fantastic!

bdwilliamson / vimpy

extend code to feature groups #5