Closed dillonalaird closed 7 years ago
Step should return the total reward received over n_action_repeat instead of the last reward received.
I think it's better to make this changeable depends on the environment.
Step should return the total reward received over n_action_repeat instead of the last reward received.