According to https://mcfly.readthedocs.io/en/latest/reference.html?highlight=early#mcfly.train_models_on_samples,
Unless ‘None’ early Stopping is used for the model training. Set to integer to define how many epochs without improvement to wait for before stopping. Default is ‘auto’ in which case the patience will be set to number of epochs/10 (and not bigger than 5).
This means that if we select a number of epochs below 10, by default, any failure to improve within 1 epoch will trigger early stopping.
As a default, this is too aggressive. Alternatively, we don't do early stopping if the total number of epochs is below 10.
All of the above models are decreasing their loss on the validation set, but did not have the time yet to switch to any better strategy than choosing the majority class in the train set.
According to https://mcfly.readthedocs.io/en/latest/reference.html?highlight=early#mcfly.train_models_on_samples,
Unless ‘None’ early Stopping is used for the model training. Set to integer to define how many epochs without improvement to wait for before stopping. Default is ‘auto’ in which case the patience will be set to number of epochs/10 (and not bigger than 5).
This means that if we select a number of epochs below 10, by default, any failure to improve within 1 epoch will trigger early stopping. As a default, this is too aggressive. Alternatively, we don't do early stopping if the total number of epochs is below 10.To illustrate see:
All of the above models are decreasing their loss on the validation set, but did not have the time yet to switch to any better strategy than choosing the majority class in the train set.