Open taku-y opened 3 years ago
Following the paper of DQN Atari, epsilon greedy policy with e=0.05 should be used in evaluation (Evaluation procedure, METHODS).
Following the paper of DQN Atari, epsilon greedy policy with e=0.05 should be used in evaluation (Evaluation procedure, METHODS).