How can I use the "ppo_rnd_tensorflow.py " to train BipedalWalker and LunarLander

wisnunugroho21 / reinforcement_learning_ppo_rnd

Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation

GNU General Public License v3.0

47 stars 5 forks source link

**Is your feature request related to a problem? Please describe.** I really appreciate your coding because it helped me a lot.

I am just a beginner of RL and I wonder if I can use ppo_rnd_tensorflow.py to train BipedalWalker and LunderLander by filling some gaps about the environment.

But I am in China now, and it is really slow to download your codes because of some bad effects of the COVID-19. So I wonder if you have tried it before? I have noticed that your Results file where you have noted that they are the results(NON-RND)

Describe the solution you'd like I am really looking forward to your reply or whether you think it is reasonable or not? If so, I will try then after my community network recovers from the COVID-19.

Additional context Thank you a lot!

wisnunugroho21 / reinforcement_learning_ppo_rnd

How can I use the "ppo_rnd_tensorflow.py " to train BipedalWalker and LunarLander #4