Regarding the calculation of split gain

microsoft / LightGBM

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

MIT License

16.67k stars 3.83k forks source link

Hi I try to figure out how is split gain getting calculated here, it is key measure.

I noticed in issue #1230 The supporter wrote: The split gain and leaf output is calculated by sum_grad / sum_hess.

I want to know why? seems the split gain is related to the way we measure impurity(GINI,Entropy,etc) In entropy case, I remember the split gain should be H(Y)-H(Y|X), and how it related to sum_grad/sum_hess?

And should it be different between classification and regression case? I mean it seems for regression and classification, we should have different way to calculate. But if it is the same, then we may use the same logic to calculate the impurity.

Any material regarding this is welcomed.

microsoft / LightGBM

Regarding the calculation of split gain #6243