Closed dongfangliu closed 3 years ago
I check through the ppo2 script for a whole day, add i did not see any code that resets the environment when the step() function return dones=True
step()
dones=True
That's really strange
This is done by vectorized environments. Any default environments will be wrapped into a vectorized environment of one, which then automatically resets episodes when they hit done=True.
done=True
I check through the ppo2 script for a whole day, add i did not see any code that resets the environment when the
step()
function returndones=True
That's really strange