Closed Moirai7 closed 1 year ago
import envpool, numpy, sys print(envpool.__version__, numpy.__version__, sys.version, sys.platform)
0.8.2 1.21.6 3.7.10 | packaged by conda-forge | (default, Feb 19 2021, 16:07:37) [GCC 9.3.0] linux
I tried the bellowing code and found out the initial state of breakout enviroment is the same with different seed. I wonder why?
import envpool import numpy as np for s in [0,1,2,3,4]: env = envpool.make_gymnasium( 'Breakout-v5', num_envs=1, seed=s*1000, episodic_life=False, reward_clip=False, stack_num=4, ) observation, info = env.reset() print(observation) print(s, np.sum(observation))
It's the same behavior if you try gym or atari-py
0.8.2 1.21.6 3.7.10 | packaged by conda-forge | (default, Feb 19 2021, 16:07:37) [GCC 9.3.0] linux
I tried the bellowing code and found out the initial state of breakout enviroment is the same with different seed. I wonder why?