Consider non-stationary Bandit algorithm for adaptive mutations

aimclub / GOLEM

Graph Optimiser for Learning and Evolution of Models

https://thegolem.readthedocs.io

BSD 3-Clause "New" or "Revised" License

63 stars 7 forks source link

Consider non-stationary Bandit algorithm for adaptive mutations #159

Closed gkirgizov closed 11 months ago

gkirgizov commented 1 year ago

Distribution of which mutation is best is in general non-stationary during optimization process. So we need considering appropriate algorithms, because default bandit algorithms assume stationary distribution p(reward|action).