keiohta / tf2rl

TensorFlow2 Reinforcement Learning
MIT License
464 stars 104 forks source link

Reproduce ApeX paper results for continuous action #22

Open keiohta opened 5 years ago

keiohta commented 5 years ago

Distributed Prioritized Experience Replay Hyper parameters are shown in appendix D

keiohta commented 5 years ago
keiohta commented 5 years ago

Waiting for cpprb implementation of NStepReplayBuffer.