Closed thebes2 closed 2 years ago
Collect random episodes when initializing a new model and collect rollouts using the model when resuming training on some model.
Collect random episodes when initializing a new model and collect rollouts using the model when resuming training on some model.