Feature request: Validation data metrics for model selection

heal-research / pyoperon

Python bindings and scikit-learn interface for the Operon library for symbolic regression.

MIT License

34 stars 10 forks source link

Currently, SymbolicRegressor returns a model that better complies with a certain criteria. This, however, is computed on the training set. Machine learning best practices dictate that model selection should be done using a validation set. Currently, this can be "hacked" by selecting the best pareto front individual against a validation metric after the SymbolicRegressor completes its run. However, with callbacks (see https://github.com/heal-research/pyoperon/issues/18) this feature could allow for earlystop criteria using the validation set. This is common in machine learning packages with iterative training (see Keras, Lightning, xGboost, etc for examples).

heal-research / pyoperon

Feature request: Validation data metrics for model selection #19