I've been trying to train an agent to play breakout and the training is really slow. This is really confusing to me since, according to the paper, it should take 7 hours to do a full training of 100k steps. My experience has been different:
It seems you could try more cpu and gpu actors, such as --cpu_actor 14 --gpu_actor 20. Since you have 4 RTX6000 and each RTX6000 has more than 20GB of memory, I think the original bash file train.sh is runnable on your machine.
First of all, congratulations on the great work!
I've been trying to train an agent to play breakout and the training is really slow. This is really confusing to me since, according to the paper, it should take 7 hours to do a full training of 100k steps. My experience has been different:
Running time
Hardware:
Running command
Do you have any idea or advice so that we can optimize the runtime?
@YeWR