wisnunugroho21 / reinforcement_learning_ppo_rnd

Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
GNU General Public License v3.0
47 stars 5 forks source link

How can I use the "ppo_rnd_tensorflow.py " to train BipedalWalker and LunarLander #4

Open suyunjiephoebe opened 4 years ago

suyunjiephoebe commented 4 years ago

**Is your feature request related to a problem? Please describe.** I really appreciate your coding because it helped me a lot.

I am just a beginner of RL and I wonder if I can use ppo_rnd_tensorflow.py to train BipedalWalker and LunderLander by filling some gaps about the environment.

But I am in China now, and it is really slow to download your codes because of some bad effects of the COVID-19. So I wonder if you have tried it before? I have noticed that your Results file where you have noted that they are the results(NON-RND)

Describe the solution you'd like I am really looking forward to your reply or whether you think it is reasonable or not? If so, I will try then after my community network recovers from the COVID-19.

Additional context Thank you a lot!

wisnunugroho21 commented 4 years ago

I have not yet created continuous version for PPO RnD. So, the code only work for discrete env. I will write for continuous version

for LunarLander, just change env_name = "Env Name" to env_name = "LunarLander-v2"

Sorrry for late reply