Open adam2392 opened 1 year ago
Will be closed by: https://github.com/neurodata/scikit-learn/pull/46
A benchmarking done using cc18's openml dataset with categorical features would be nice: https://github.com/scikit-learn/scikit-learn/pull/12866#issuecomment-455350207
Basically run sklearn w/o categorical support and one-hot encoding vs w/ categorical support
compare both.
We would need to enable this in the sklearn fork's splitter. The original PR in upstream sklearn was never merged unfortunately: https://github.com/scikit-learn/scikit-learn/pull/12866.
BaseDecisionTree
and follow theHistGradientBoosting*
API patterns