Closed eugene-kharitonov closed 3 years ago
Should be handy when training populations of agents; this way we have multiple pairs of agents in a single batch. In turn, this would reduce the gradient variance.
Should be handy when training populations of agents; this way we have multiple pairs of agents in a single batch. In turn, this would reduce the gradient variance.