Hello,
Thank you very much for providing this code. A student and I have been following the training example for Stage1, but when one of the environments reaches the max number of episodes it looks like the code enters an infinite loop, and the other environments do not seem to be continuing their iterations. Is this supposed to occur? if not, what is the expected output after the number of episodes is completed ?
I think the reason may be the synchronization of processes. Some processes have finished but others are still running, so they will wait for processes whose are finished.
Hello, Thank you very much for providing this code. A student and I have been following the training example for Stage1, but when one of the environments reaches the max number of episodes it looks like the code enters an infinite loop, and the other environments do not seem to be continuing their iterations. Is this supposed to occur? if not, what is the expected output after the number of episodes is completed ?
Thanks,
Julio Godoy