Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
GNU General Public License v3.0
47
stars
5
forks
source link
RND_epochs #11
Closed
pavelxx1 closed 3 years ago
Hi can I use for training more then 5 rnd_epoch? or it's only const? thx