Hello! Thank you for this great work. Two questions:
Max steps
I'm having trouble finding the maximum number of steps each environment is rolled out for per episode (during training and eval). Could you direct me to the place I can see that?
Meaning of Environment Steps
Also, I'm curious about the specific meaning of environment steps is in the context of graphs like this one. Does it mean the total number of times env.step() is called during training?
Hello! Thank you for this great work. Two questions:
Max steps
I'm having trouble finding the maximum number of steps each environment is rolled out for per episode (during training and eval). Could you direct me to the place I can see that?
Meaning of Environment Steps
Also, I'm curious about the specific meaning of environment steps is in the context of graphs like this one. Does it mean the total number of times
env.step()
is called during training?