Closed vkostyuk closed 6 years ago
After looking at what caretEnsemble:::makePredObsMatrix
does, I think I can answer my own question. The dataset for ensemble fitting consists of the rlistbound test set predictions of the models. In particular, if the union of the test sets is not the whole dataset (as was the case in for resampling scheme), there will be fewer rows in the ensembling dataset than in the original dataset, so the same resampling scheme can't be used.
In the Brief Intro to caretEnsemble, it says
Why is this the case? I need a particular resampling scheme (specified using
index
andindexOut
) due to the nature of the data, and I would like to use the same scheme for trainig the ensemble.