openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.
https://spinningup.openai.com/
MIT License
10.18k stars 2.23k forks source link

Symmetry of terms: On-Policy Value Function VS On-Policy Action-Value Function? #353

Open glisca opened 2 years ago

glisca commented 2 years ago

Why do you use the term On-Policy Value Function instead of On-Policy State-Value Function? At least for the sake of symmetry with On-Policy Action-Value Function, even that this symmetry is missing from Rich Sutton's book Reinforcement Learning: An Introduction.