Implement ApeX DQN - Githubissues

keiohta / tf2rl

TensorFlow2 Reinforcement Learning

MIT License

467 stars 103 forks source link

Open keiohta opened 5 years ago

keiohta commented 5 years ago

Distributed Prioritized Experience Replay
Current implementation works only for DDPG variants, so extends it to work with DQN like agent

keiohta commented 5 years ago

keiohta commented 5 years ago

[x] Change noise level of each explorer

keiohta commented 5 years ago

keiohta commented 5 years ago

Decided to plot learner and evaluator to different two plots at commit dfc0c106a37866b87e8341a89ced9bf617ad7c39 because of followings:

TensorBoard does not seem to run with more than two files in the same directory
Avoid sharing tf.contrib.summary because it is often called from both learner and evaluator that might result in significant bottle neck

keiohta commented 5 years ago

keiohta commented 5 years ago

Waiting for cpprb implementation of NStepReplayBuffer.