Open mily20001 opened 3 years ago
Hello, Probably a duplicate of https://github.com/hill-a/stable-baselines/issues/196 Which OS are you using?
I would recommend you to use PPO2 (or even Stable-Baselines3 PPO) as it also supports multiprocessing and usually give comparable results to ACKTR.
Hi, thanks for response @araffin I'm making kind of comparison how RL algorithms perform on problem simulated by my custom env, that's why I wanted to test ACKTR. I've seen #196, but in my case memory is not an issue I think My OS is openSUSE linux 15.2
Describe the bug Basically I wanted to check how ACKTR would perform on my custom env, however it performs first 10 updates quite fast, and then each iteration is taking very long on my env (with async_eigen_decomp=True it takes even longer) and hangs on atari. During first 10 updates it uses all cores of my cpu and significant part of my gpu, while after that it uses only one core in 100% and nothing else. What's interesting is that same thing happens when i use env created with
make_atari
, while for env created withmake_atari_env
it seems to perform better (still slow and only one core after 10th update, but doesn't completely hang asmake_atari
env).Code example
For
make_atari_env
example output from callback looks like this:For
make_atari
:For my custom env (4 workers wrapped in
SubprocVecEnv
, observation shape isBox(0, 255, (90, 120, 5), uint8)
):System Info Describe the characteristic of your environment:
Additional context Add any other context about the problem here.