Closed RylanSchaeffer closed 2 years ago
I just found https://github.com/MushroomRL/mushroom-rl/blob/dev/examples/lqr_pg.py
If these examples could be more clearly linked in the documentation, that would be fantastic!
This is a good suggestion! I will put also this in the ToDo list the implementation of policy gradient methods is done only in the context of function approximation/continuous action spaces. Supporting finite actions in these approaches would add major overhead and complications to the code, for no practical benefit (there's not much benefit in using a policy gradient in a setting where you can use a value-based approach directly and easily)
I'm trying to implement a simple REINFORCE agent on
Gridworld
. However, I keep hitting the following error:It appears that the policy needs to be initialized with an approximator. I would really appreciate a simple tutorial showing how to create an approximator and a policy on a simple environment.
Thanks in advance!