keiohta / tf2rl

TensorFlow2 Reinforcement Learning
MIT License
467 stars 103 forks source link

Implement ApeX DQN #11

Open keiohta opened 5 years ago

keiohta commented 5 years ago
keiohta commented 5 years ago
keiohta commented 5 years ago
  • [x] Change noise level of each explorer

https://github.com/keiohta/tf2rl/commit/edc06fba1f85ae010d7e838f4f7d073c8ff42e60

keiohta commented 5 years ago
keiohta commented 5 years ago

Decided to plot learner and evaluator to different two plots at commit dfc0c106a37866b87e8341a89ced9bf617ad7c39 because of followings:

keiohta commented 5 years ago
keiohta commented 5 years ago

Waiting for cpprb implementation of NStepReplayBuffer.