Self-play does not resume correctly using --resume

fedetask commented 4 years ago

Describe the bug When resuming a training that employes self-play, the training does not resume from the correct team.

To Reproduce

Start a self-play training that has team 0 and team 1.
Wait until team 0 finished the first training and the learning team is swapped to team 1
Interrupt training with CTRL-C
Run again the same mlagents-learn command using the --resume flag The training now starts from team 0 again, instead of resuming from team 1 as it should.

Environment (please complete the following information):

Unity Version: Unity 2019.3.13f1]
OS + version: Ubuntu 18.04
ML-Agents version: 0.16.0
TensorFlow version: 2.1.0
Environment: Custom

Note I haven't checked with one of the examples, and I'll do it as soon as possible. However, I feel this is not dependent on the environment, but rather on the training code.

anupam-142857 commented 4 years ago

@ervteng do we store the active team when Ctrl-C is called? Should the user expect to resume training from team-0 or from the last active team?

anupam-142857 commented 4 years ago

@fedetask if the training runs long enough after resuming there should be plenty of swaps between the teams. Would it really matter if the training does not resume from the team where it stopped.

hvpeteet commented 2 years ago

Closing as stale

github-actions[bot] commented 2 years ago

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

Unity-Technologies / ml-agents

Self-play does not resume correctly using --resume #4029