Split validation data by lake

DOI-USGS / lake-temperature-lstm-static

Predict lake temperatures at depth using static lake attributes

Other

0 stars 3 forks source link

Split validation data by lake #31

Closed AndyMcAliley closed 2 years ago

AndyMcAliley commented 2 years ago

Test data is split by lake, but validation data is not. This can lead to overfitting, since the validation data used to determine early stopping criteria is taken from the same lakes as the training data. This split should happen during 2_process at the same time that the test set is formed, so we can remove the training/validation splitting that happens in 3_train.

AndyMcAliley commented 2 years ago

Closed by #37