moabitcoin / cherry-pytorch

Reinforcement Learning Tutorials & other bedtime stories in PyTorch
MIT License
11 stars 1 forks source link

Policy gradient doom prep #30

Closed sandhawalia closed 4 years ago

sandhawalia commented 4 years ago

This PR brings in learning from policy_gradients in Cartpole-v0 into Doom