ray-project / tutorial

778 stars 212 forks source link

[Request] More low-level examples on RLlib #98

Closed jpiabrantes closed 5 years ago

jpiabrantes commented 5 years ago

I am trying to learn how to use RLlib default models/optimisers/environments but also how to create my own and add them to RLlib. I would like to see an example (could be the PPO with Cartpole) being built from scratch step by step - when I look into the code and see it whole at once I get a bit overwhelmed. Was this already done somewhere else?

Love what you have been building! Cheers.

ericl commented 5 years ago

Hey @jpiabrantes , you might want to look at the policy gradient example, it's a minimal PG implementation that still works with all the rllib features.

https://github.com/ray-project/ray/tree/master/python/ray/rllib/agents/pg

jpiabrantes commented 5 years ago

Thanks @ericl that helped!