sentenai / reinforce

Reinforcement learning in haskell
https://sentenai.github.io/reinforce/
BSD 3-Clause "New" or "Revised" License
44 stars 17 forks source link

Add some policy gradient methods to the algorithms #13

Open stites opened 6 years ago

stites commented 6 years ago