udacity / deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program
https://www.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893
MIT License
4.9k stars 2.34k forks source link

Show differences from optimal #26

Closed tkharris closed 2 years ago

tkharris commented 5 years ago

Show the differences in the mc backjack policy plot from the optimal policy. Just puts some little red X's on the graph which show where your blackjack policy deviates from the optimal policy.

tkharris commented 5 years ago

Minor improvement but I found it useful. Here's what it looks like:

mc_policy