DistrictDataLabs / yellowbrick

Visual analysis and diagnostic tools to facilitate machine learning model selection.
http://www.scikit-yb.org/
Apache License 2.0
4.26k stars 554 forks source link

Outlier Pruning #104

Open rebeccabilbro opened 7 years ago

rebeccabilbro commented 7 years ago

Create some kind of visualizer that enables the user to steer the outlier removal/pruning process through visualizations.

For a single dimension, this would depict an iterative outlier pruning process, for instance using z-scores and eliminating anything outside of 3z. For multidimensional data, perhaps select the feature with the most variance. Show how the variance of the target is changing throughout the outlier removal process.

bbengfort commented 7 years ago

Visualize pruning in a multi-dimensional way.