Closed biggzlar closed 5 years ago
run_episode() would simply return the most recent reward. This makes more sense imo.
run_episode()
Hi @biggzlar
Thanks for this change. I completely agree that returning the cumulative episode reward makes much more sense.
run_episode()
would simply return the most recent reward. This makes more sense imo.