H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
There is a bug which makes the plot weird and likely incorrect. I used red rectangle and red arrow to emphasize the differences that I noticed. The red arrow shows an issue with empty string category that is now being split in the plot in to NAs which has some values in the rug plot but not in the empty string category.
Note that the old version is has a bug too. Looking at the empty string category it is the most common so the histogram in the background should show that as well.
pd_plot also seems to be affected by the same issue. IIRC Zuzana refactored the code so that the common things in ICE and PDP are in one function so it's possible it is just one bug
There is a bug which makes the plot weird and likely incorrect. I used red rectangle and red arrow to emphasize the differences that I noticed. The red arrow shows an issue with empty string category that is now being split in the plot in to NAs which has some values in the rug plot but not in the empty string category.
It used to look like:
Note that the old version is has a bug too. Looking at the empty string category it is the most common so the histogram in the background should show that as well.
pd_plot
also seems to be affected by the same issue. IIRC Zuzana refactored the code so that the common things in ICE and PDP are in one function so it's possible it is just one bug