This book is awesome. I'm just leaving a note to anyone who wants to play with the code of Chapter 4 and test the simple strategies progressively with theEnvironemnt ("BanditSlipperyWalk-v0").
Within each loop, you need to reset the environment each time if you plan on inspecting the values of Q and Qe
This book is awesome. I'm just leaving a note to anyone who wants to play with the code of Chapter 4 and test the simple strategies progressively with theEnvironemnt ("BanditSlipperyWalk-v0"). Within each loop, you need to reset the environment each time if you plan on inspecting the values of Q and Qe