Open talegari opened 5 years ago
That's a good idea. What should be the return value? We could save a list of node IDs for every observation and tree but a binary matrix for each tree (as in sklearn) could also work because node IDs are always increasing with the depth.
IMHO, sparse matrix (like sklearn) is a good choice as it might allow fast computations to process them.
Was this ever implemented ? I would be just as interested.
Hi Marvin,
I would be be great idea to provide decision path of an test observation (Sequence of node ids the observation passes through while prediction). Adding
decision_path
totype
argument inranger::predict.ranger
seems like the intuitive position for it. This is helpful in understanding the data better beyond their membership in terminal nodes. Let me know what you think.Reference: scikit-learn implementation