Open kan-s0 opened 2 years ago
Describe the bug RuntimeError when running mpo
To Reproduce
python main.py --config.mpo.atari --env.name breakout --sync
When config is modified with the values shown in the paper, it occurs faster and more frequently.
Expected behavior
inf
nan
Screenshots
Development Env. (OS, version, libraries):
Additional context
Describe the bug RuntimeError when running mpo
To Reproduce
When config is modified with the values shown in the paper, it occurs faster and more frequently.
Expected behavior
inf
,nan
or element < 0Screenshots
training graph
error txt
mpo generated agent code
Development Env. (OS, version, libraries):
Additional context