Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
MIT License
1.09k stars 186 forks source link

Mountain Car #24

Open jpark0315 opened 4 years ago

jpark0315 commented 4 years ago

Thanks for open sourcing this, this is very good stuff.

However, the code doesn't seem to work on mountain car env? Maybe is it because I have only 2600 expert state/action pairs?