smilesun / rlR

Deep Reinforcement Learning in R (Deep Q Learning, Policy Gradient, Actor-Critic Method, etc)
https://smilesun.github.io/rlR
Other
26 stars 4 forks source link

TD error clipping on top of gradient clipping #27

Closed smilesun closed 6 years ago

smilesun commented 6 years ago
  1. envronment agent interaction slide first
  2. equations to behind
  3. python independency
  4. custom environent more emphasis
  5. more documents