Implementation of the Deep Q-Learning approach for multi-agent systems. The goal is to ensure that each agent has a distinct policy compared to every other agent. Our objective is to encourage diverse policies among these agents, even if they are not optimal. The emphasis lies in fostering cooperation among agents while acknowledging and embracing the diversity in their individual decision-making strategies.
Our environment is a square grid of size n x n that can contain p agents. Each agent has five possible actions:
files: Class_environnement.py
8 agent train on a 16x16 map with a deep Q learning model
file Deep_Q_Learning_diversifié.ipynb
Q_learning_diverse contain a simple Q learning solution of the environment with two agents
assets contain the assets for the environment
Test-gif contain evaluation of some models after trainning