Closed RuanJohn closed 8 months ago
Add the option to linearly decay the actor and critic learning rates during training.
Based PPO implementation details blog here.
Simple utils file that creates a linear learning rate decay scheduler and passes that to the optimisers.
Default behaviour (constant learning rates) are retained by setting decay_learning_rates: False in the system configs.
decay_learning_rates: False
What?
Add the option to linearly decay the actor and critic learning rates during training.
Why?
Based PPO implementation details blog here.
How?
Simple utils file that creates a linear learning rate decay scheduler and passes that to the optimisers.
Extra
Default behaviour (constant learning rates) are retained by setting
decay_learning_rates: False
in the system configs.