Closed fedetask closed 2 years ago
@ervteng do we store the active team when Ctrl-C is called? Should the user expect to resume training from team-0 or from the last active team?
@fedetask if the training runs long enough after resuming there should be plenty of swaps between the teams. Would it really matter if the training does not resume from the team where it stopped.
Closing as stale
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
Describe the bug When resuming a training that employes self-play, the training does not resume from the correct team.
To Reproduce
mlagents-learn
command using the--resume
flag The training now starts from team 0 again, instead of resuming from team 1 as it should.Environment (please complete the following information):
Note I haven't checked with one of the examples, and I'll do it as soon as possible. However, I feel this is not dependent on the environment, but rather on the training code.