MansMeg / IntroML

Introductory course in Machine Learning for master students in Statistics at Uppsala University
17 stars 11 forks source link

Assignment 8 typos and clarifications #20

Closed andreasostling closed 10 months ago

MansMeg commented 10 months ago

The return is the aggregated reward. So we should not change return to reward, but maybe change that the function should output xyz instead of returning it?

andreasostling commented 10 months ago

I'm not sure I'm following completely, is this what you meant?

MansMeg commented 10 months ago

The concept return mean two things here. First, the return in RL is the aggregation of the reward signals (i.e. we want to maximize the return). Second, we use "return" for the output from the function. Does this make more sense?

andreasostling commented 10 months ago

Ahhh, I see. Okay yeah that makes sense. Let's just leave it as it was then.