Unity-Technologies / ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
https://unity.com/products/machine-learning-agents
Other
17.12k stars 4.15k forks source link

Self-play does not resume correctly using --resume #4029

Closed fedetask closed 2 years ago

fedetask commented 4 years ago

Describe the bug When resuming a training that employes self-play, the training does not resume from the correct team.

To Reproduce

Environment (please complete the following information):

Note I haven't checked with one of the examples, and I'll do it as soon as possible. However, I feel this is not dependent on the environment, but rather on the training code.

anupam-142857 commented 4 years ago

@ervteng do we store the active team when Ctrl-C is called? Should the user expect to resume training from team-0 or from the last active team?

anupam-142857 commented 4 years ago

@fedetask if the training runs long enough after resuming there should be plenty of swaps between the teams. Would it really matter if the training does not resume from the team where it stopped.

hvpeteet commented 2 years ago

Closing as stale

github-actions[bot] commented 2 years ago

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.