question about A2C - Githubissues

Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

MIT License

1.09k stars 186 forks source link

question about A2C #17

Open kishanpb opened 4 years ago

kishanpb commented 4 years ago

Did you try training A2C agent on Swimmer environment? I was not able to train it! Tested many NN parameters, but was unsuccessful.