Represent categorical features in scatterplot as sized bubbles

MattJBritton / ForestfortheTrees

Interactive visualization of ensemble ML algorithms (e.g. Gradient Boosting Classifiers) for explainable ML.

GNU General Public License v3.0

0 stars 0 forks source link

Currently, the datapoint scatterplot in the components view does not represent categorical features well. They overplot and map onto the axes or spots between the heatmap squares. Fix this by:

A) aggregate data cases by category and calculate a sum, and B) adjust their location

This requires the pre-calculation of new data subsets before the call to explain() (since these distributions will not change for a given dataset and granularity), so maybe perform this calculation in build_base_model()?

MattJBritton / ForestfortheTrees

Represent categorical features in scatterplot as sized bubbles #2