Tricks for fine-tuning the hyperparameters

Grid2op / l2rpn-baselines

L2RPN Baselines a repository to host baselines for l2rpn competitions.

Mozilla Public License 2.0

81 stars 45 forks source link

Hello,

Fine tuning NN hyperparameters is a whole research area and there is no definitive answer. For the DQN, it has been done by trial and error, alongside rigorous data tracking in both spreadsheets and tensorboard. A good starting point is also to use the hyperparameters of the reference literature.

Finally, I would consider the DQN more like an example of how to get started with L2RPN+NN approach, not like a really good performing baseline. This is because the L2RPN challenge is more about solving the increasing complexity as the grid size grows from 10, to 100, 1000, and 10000 nodes; which a DQN cannot handle with its 1:1 mapping of output to actions.

Best

Grid2op / l2rpn-baselines

Tricks for fine-tuning the hyperparameters #32