Expected behavior
dqn with 2 or 3 steps is worse than 1 step dqn for atari breakout and lunar, I'm not sure if it's a bug or if it's supposed to be worse. in any case it would be nice to have the dqn n_steps implemented in mushroom_rl
System information (please complete the following information):
Describe the bug I modified DQN to enable n_steps DQN, but I get worse results, am I missing something?
To Reproduce use dqn with this function and defining self.n_steps in the init:
Expected behavior dqn with 2 or 3 steps is worse than 1 step dqn for atari breakout and lunar, I'm not sure if it's a bug or if it's supposed to be worse. in any case it would be nice to have the dqn n_steps implemented in mushroom_rl
System information (please complete the following information):
thanks