Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
MIT License
1.09k stars 186 forks source link

result is not good #7

Closed ghost closed 6 years ago

Khrylx commented 6 years ago

Hi there,

Could you please paste the commands you run and all the output after 1000 iterations? The term “not good” is pretty vague, and I don’t know how to debug this issue.

ghost commented 6 years ago

Yes,the result is good, sorry for delayed reply. Thanks very much