This python file implements the Deep Deterministic Policy Gradient algorithm in a multiagent tag-game gym environment, slightly modified by Rohan Sawhney. I did it as a final assignment for Move37, a School of AI course.
Use the following command with the desired NUMBER_OF_EPISODES
to lauch the file and start training :
python episode.py --n_episodes NUMBER_OF_EPISODES
Further hyperparameter tuning can be done by passing arguments listed in episode.py
.
You need to install both gym
and PyTorch
to run the file.
pip install gym
Here we plot the length of episodes as training goes on :
This project is under the MIT License - see the LICENSE file for details.
Inspired from Rohan Sawhney's implementation.