question about _on_step method in custom callback using also a custom wrapper
I am new to stable_baselines and I was wondering if it is normal that the on_step method of a callback start after reset of the env.
Isn't it suppose to start after the step method of the env (or the wrapper in my case)?
I know how I may avoid the issue, but I was wondering if there is a way to call the callback function exactly after the step method of the env.
The env is the custom env from the tutorial (snake env).
The following code should show that the callback print always episode_length and episode_return equal to zero because the env is resetted.
question about _on_step method in custom callback using also a custom wrapper
I am new to stable_baselines and I was wondering if it is normal that the on_step method of a callback start after reset of the env. Isn't it suppose to start after the step method of the env (or the wrapper in my case)? I know how I may avoid the issue, but I was wondering if there is a way to call the callback function exactly after the step method of the env. The env is the custom env from the tutorial (snake env). The following code should show that the callback print always episode_length and episode_return equal to zero because the env is resetted.