H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Do the 10bn row tests with the given seeds via CreateFrame and then select random rows from the result. X1 and X2 sometimes contain the same value which looks suspicious. This didn't happen when the data was loaded from file, so possibly something odd with CreateFrame or the randomness (which is supposed to be PCG in H2O like I plugged in at C level to create the files, but I should look and check.)
Do the 10bn row tests with the given seeds via CreateFrame and then select random rows from the result. X1 and X2 sometimes contain the same value which looks suspicious. This didn't happen when the data was loaded from file, so possibly something odd with CreateFrame or the randomness (which is supposed to be PCG in H2O like I plugged in at C level to create the files, but I should look and check.)