Closed raymondchua closed 2 years ago
Not out of the box since DQN and variants expect a single stream of experience. One could get something working reasonably quickly by adapting TransitionAccumulator
so it accepts a batch of time steps.
Awesome, thanks @jqdm !
Hi, Does this work with vectorized environments (such as using gym.vector.AsyncVectorEnv function)?
Best, Raymond