Closed RPegoud closed 11 months ago
Currently, the training of DQN on Cartpole is quite unstable, probably due to a constant epsilon value. To test performances, add epsilon annealing.
Currently, the training of DQN on Cartpole is quite unstable, probably due to a constant epsilon value. To test performances, add epsilon annealing.