Support for early stopping in `ModelWrapper.train_on_dataset()`

Is your feature request related to a problem? Please describe. In the ModelWrapper.train_on_dataset() function, the number of epochs to train for needs to be specified. When tuning the learning convergence, it is challenging to decide on this value as the amount of training data increases throughout the active learning process.

Describe the solution you'd like Use early stopping in the ModelWrapper.train_on_dataset() function, which interrupts training as the validation loss stops decreasing.

Describe alternatives you've considered Similar to the Model.fit() function in Keras, the ModelWrapper.train_on_dataset() function could take callbacks and validation_data arguments.

Additional context Note that early stopping at this level is different from early stopping at the level of the active learning process, which stops labelling new instances when the current labelled set contains all the information necessary.

baal-org / baal

Support for early stopping in `ModelWrapper.train_on_dataset()` #261