trevorstephens / gplearn

Genetic Programming in Python, with a scikit-learn inspired API
http://gplearn.readthedocs.io/
BSD 3-Clause "New" or "Revised" License
1.56k stars 274 forks source link

Optimal Population Size #291

Closed LACORTEZ33 closed 1 year ago

LACORTEZ33 commented 1 year ago

For a relatively small datasets we are trying to find y=f(x) We realized the results quality are highly dependent on the number of programs (population size).
The optimal number of programs seems to be a function of number of data points and overall correlation between x and y. Any hint to define the optimal number of programs?

Thanks

trevorstephens commented 1 year ago

Sorry but every problem is different and I cannot advise on individual datasets. There is general advice in the docs