Hi @AlineTalhouk Using RankAggreg, I am able to get a list of the top performing algorithm for each bootstrapped resample. For example, if n.boot = 3, then we might get something like "svm", "svm", "rf". The paper describes a majority vote approach for the ensemble classifier in a new sample X (independent data). So I'm wondering what we want to output for the splendid() function.
Currently, I implemented the majority voting for ensemble class prediction on the entire, original dataset. But certainly there is overfitting there.
Hi @AlineTalhouk Using RankAggreg, I am able to get a list of the top performing algorithm for each bootstrapped resample. For example, if n.boot = 3, then we might get something like
"svm", "svm", "rf"
. The paper describes a majority vote approach for the ensemble classifier in a new sample X (independent data). So I'm wondering what we want to output for thesplendid()
function.Currently, I implemented the majority voting for ensemble class prediction on the entire, original dataset. But certainly there is overfitting there.