szilard / benchm-ml

A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
MIT License
1.87k stars 334 forks source link

Add Rborist #6

Closed eddelbuettel closed 9 years ago

eddelbuettel commented 9 years ago

Could you add Rborist in serial and parallel mode to add another (fast?) random forest implementation?

Great project. Very useful to have comparisons.

szilard commented 9 years ago

I actually tested that, but it was slower (160sec) than R's randomForest package (50sec) on the smallest 10K data even in parallel mode (it was using all 32 cores).

eddelbuettel commented 9 years ago

Thanks for the follow-up. I will ask Mark about it; he will be here for R/Finance in a few weeks.