Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
MIT License
1.09k stars 186 forks source link

Is this repository only work for Gym Environments? #29

Open XueminLiu111 opened 3 years ago

XueminLiu111 commented 3 years ago

What if we have a custom environment and the value of action has bound? Did you use any functions such as tanh to bound the Gaussian samples to give us a reasonable action?