vojtamolda / reinforcement-learning-an-introduction

Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).
340 stars 74 forks source link

Exercise 3.21 #24

Open rjscherrer opened 1 year ago

rjscherrer commented 1 year ago

I think the solution to exercise 3.21 is wrong as $\upsilon{\text{putt}} = q*(s, \text{putter})$. Thus, the solution to exercise 3.21 is the same image as the upper part of figure 3.3 on page 61.