LxMLS / lxmls-guide

Lisbon Machine Learning Summer School Lab Guide
81 stars 61 forks source link

Updates for RL #123

Closed q0o0p closed 5 years ago

q0o0p commented 5 years ago

Fix exercise 6.1 description, it's confusing

gamma is discount rate, not learning rate these are different things learning rate is alpha, there is no learning rate in Policy Evaluation

MStaniek commented 5 years ago

looks good to me