Why PipeOpTuneThreshold reduce the classif.ce?

Hey, good catch!

This can happen due to various reasons, the most likely one, in this case, being that we train a "cross-validated" learner, so each model only gets to see 2/3 of the training data (in comparison to the original one which gets to see all training data). For smaller datasets, such as the ones in our example, this sometimes leads to poorer results. Other reasons can include just pure randomness/overfitting if the differences between the default threshold (0.5) and the tuned threshold are small.

I have added a different dataset as well as a seed and some more explanations in #108 .

mlr-org / mlr3gallery

Why PipeOpTuneThreshold reduce the classif.ce? #106