Closed joakimpersson closed 10 years ago
Unbound variables in equation 2.1: t. Add non-discounted case!
Perhaps veckla ut summan lite. Ej anvanda stora R, det ar redan definerat som reward function. Kanske detta r:et? http://webdocs.cs.ualberta.ca/~sutton/book/ebook/node33.html
Byt namn till utility på return JAA :D
Explain the discount factor better before 2.3.1, and define utility. (also called return)