ogrisel / pygbm

Experimental Gradient Boosting Machines in Python with numba.
MIT License
183 stars 32 forks source link

Special case the root split #6

Closed ogrisel closed 6 years ago

ogrisel commented 6 years ago

When computing the histogram for the root node split, there is not need to shuffle the binned data with the sample_indices as this array is ordered (np.arange(n_samples)).

Special-casing the histogram computation gives a 10-20% improvements for the root node split which is the also the most expensive node split for a given tree.