xysun / blog

Mainly my paper reading notes
5 stars 0 forks source link

Grid world example for policy evaluation, policy iteration, and value iteration #3

Open xysun opened 5 years ago

xysun commented 5 years ago

I have been following David Silver's reinforcement learning lecture series (slides, videos)

In lecture 3 he talked about 3 things:

He used a simple grid world example to walk through all 3 concepts, and because I always feel super secure, I decided to actually code it and see if the values match.

They do. Duh.

My jupyter notebook can be found here.