Closed mctigger closed 2 years ago
Hi Tim, those are both correct. It's 16 env steps or 4 agent steps. The action repeat introduces the factor of 4 between the two.
Thank you for answering so quickly!
In the paper in the hyperparameters section it says Environment steps per update: 4. So in the paper it should actually be Agent Steps per update: 4 or Environment steps per update: 16? Just want to make sure I understand you correctly.
Ah, yes. It's every 16 frames or 4 actions. I'll update the paper to make it clearer.
Hi danijar, how many environment steps are you running per update? In the paper it is 4 (so after every step the agent makes it is updated because of action repeat?), but here in the config it says
train_every: 16
. What is the correct number?Best, Tim