geco-bern / agds

Applied Geodata Science book. Developed for the lecture(s) with the same name at the Institute of Geography, University of Bern.
https://geco-bern.github.io/agds/
Other
5 stars 6 forks source link

Inconsistent use of cross-validation #94

Closed padasch closed 1 year ago

padasch commented 1 year ago

As discussed on Monday, cross-validation gives an error estimate for a model trained on all available data. Therefore, the all-data-trained model is used as the final model, not one of the models created in the cross-validation routine.

This figure from the Google crash course suggests using the best model from within the cross-validation routine, which is inconsistent.

https://github.com/geco-bern/agds/blob/7c71dd3a9a9d6aac2c684a3eeec0967a071d3a46/10-supervised_ml_II.Rmd#L199