Closed jpiabrantes closed 5 years ago
Hey @jpiabrantes , you might want to look at the policy gradient example, it's a minimal PG implementation that still works with all the rllib features.
https://github.com/ray-project/ray/tree/master/python/ray/rllib/agents/pg
Thanks @ericl that helped!
I am trying to learn how to use RLlib default models/optimisers/environments but also how to create my own and add them to RLlib. I would like to see an example (could be the PPO with Cartpole) being built from scratch step by step - when I look into the code and see it whole at once I get a bit overwhelmed. Was this already done somewhere else?
Love what you have been building! Cheers.