DistrictDataLabs / yellowbrick

Visual analysis and diagnostic tools to facilitate machine learning model selection.
http://www.scikit-yb.org/
Apache License 2.0
4.29k stars 559 forks source link

MostInformativeFeatures visualizer #657

Open majidaldo opened 5 years ago

majidaldo commented 5 years ago

Is your feature request related to a problem? Please describe. This feature is motivated by the discussion in #510. The problem is how to visualize feature importance for multiple classes and/or instances. This requires solving two problems: 1. selecting the "most informative features" and 2. producing an appropriate visualization.

Describe the solution you'd like MIFV = MostInformativeFeatures MIFV().predict(X) would give a viz for the features most responsible for the prediction MIFV().poof()/fit(X) would show a viz for the features

For both cases, the proposed visual is a heat map where one axis is class labels or data and the other axis represents features.

Questions/Issues:

ndanielsen commented 5 years ago

@majidaldo thanks for creating this issue and following / linking with other discussions.

I look forward to the community thinking through the questions and issues on this. =)

rebeccabilbro commented 4 years ago

Going to go ahead and close this one out since it's gone a bit stale; to anyone with interest/bandwidth — feel free to reopen!

bbengfort commented 4 years ago

@rebeccabilbro this is one that is very interesting to me, perhaps I will take a crack at it later in the semester.

rebeccabilbro commented 4 years ago

That would be great @bbengfort, reopening!