moabitcoin / cherry-pytorch

Reinforcement Learning Tutorials & other bedtime stories in PyTorch
MIT License
11 stars 1 forks source link

PPO/TRPO #49

Open sandhawalia opened 4 years ago

sandhawalia commented 4 years ago

We want to support Trusted Region Policy Optimisation and Proximal Policy Optimisation