ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
MIT License
3.53k stars 832 forks source link

What can compute_grad_pen in gail.py do? #250

Open ruleGreen opened 3 years ago

ruleGreen commented 3 years ago

Hello, I am a little confused about compute_grad_pen function in gail.py, Cound someone can tell me? why we need this in this file? and what relation between this with GAIL?