ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction
MIT License
13.45k stars 4.81k forks source link

Misunderstanding in chapter 2 #117

Closed zZthebreakerZz closed 5 years ago

zZthebreakerZz commented 5 years ago

Hi, everyone. I have read the book but I still don't know why we need to add "true_reward" to "self.q_true". The book only said that they just used a normal distribution for creating q_true. Can someone explain to me details, please? Thank you!

ShangtongZhang commented 5 years ago

I remember q_true is the mean of the Gaussian.