About computing Hessian*vector

Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

MIT License

1.09k stars 186 forks source link

About computing Hessian*vector #32

Open jjjhfffjj opened 2 years ago

jjjhfffjj commented 2 years ago

Excuse me,in TRPO's code，In the following definitions： _def Fvpdirect(v):，What is the input v？and how to get it? thanks for your help!