Open PeterZhizhin opened 5 years ago
Hi @PeterZhizhin
Can you confirm that you are running the latest binary available here: https://github.com/Unity-Technologies/obstacle-tower-env#download-the-environment?
Hi @awjuliani
Yes, I run this binary. I also figured out that running in headless mode (realtime_mode=False) makes seeds work properly according to UnitySDK.log.
For me, it is always floor 0 and seed 0 after each reset. The first run is a completely random seed because you didn't reset it before running (there is no env.reset()
before while True: episode_reward = run_episode(env)
)
So I didn't encounter any bug here. My Unity log outputs this:
3/7/2019 7:40:39 PM
Log
Academy resetting
Log
Seed: 26
Log
Seed: 33
Log
Academy resetting
Log
Seed: 0
Log
Academy resetting
Log
Seed: 0
Which seems totally correct. Are you sure you're running v1.2?
Hello.
Try running this piece of code:
Run with the following command:
python run.py --realtime
Observe the agent's environment. For me, after each reset, the room's layout is always different (sometimes the door to the next level is on the right, sometimes it is straight ahead).In the output of the command, I see the following:
However, this is the output of UnitySDK.log:
I have verified that the same happens with
seed(1)
as well.