Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
MIT License
1.09k stars 186 forks source link

About computing Hessian*vector #32

Open jjjhfffjj opened 2 years ago

jjjhfffjj commented 2 years ago

Excuse me,in TRPO's code,In the following definitions: _def Fvpdirect(v):,What is the input v?and how to get it? thanks for your help!