long8v / PTIR

Paper Today I Read
19 stars 0 forks source link

[142] Trust Region Policy Optimization #154

Open long8v opened 6 months ago

long8v commented 6 months ago
image

paper

TL;DR

Details

아래 pptx에 정리

long8v commented 6 months ago

TRPO.pptx

long8v commented 6 months ago

https://math.stackexchange.com/questions/2239040/show-that-fisher-information-matrix-is-the-second-order-gradient-of-kl-divergenc