vlad17 / mve

MVE: model-based value estimation
Apache License 2.0
10 stars 0 forks source link

Debug non-determinism #294

Open vlad17 opened 6 years ago

vlad17 commented 6 years ago

The same run on the same seed, even on tensorflow CPU, is giving different learning curves. There might be some non-determinism somewhere, maybe in the environment-seeding.

alvinwan commented 6 years ago

Which test exposes this issue? At a cursory glance it looks like the env and the agents are properly seeded, but I can dig deeper.

vlad17 commented 6 years ago

No test in particular, this is just what happens when you run the same setting (Eg. main_ddpg.py) twice. don't worry about this for now.