aio152 / Deep-Reinforcement-Learning-Multi-agents

0 stars 0 forks source link

readme

MARL

Implementation of the Deep Q-Learning approach for multi-agent systems. The goal is to ensure that each agent has a distinct policy compared to every other agent. Our objective is to encourage diverse policies among these agents, even if they are not optimal. The emphasis lies in fostering cooperation among agents while acknowledging and embracing the diversity in their individual decision-making strategies.

Algorithm

algo_diverse_Qlearning

Environment

Our environment is a square grid of size n x n that can contain p agents. Each agent has five possible actions:

direction_agent

files: Class_environnement.py

Environment

Evaluation

8 agent train on a 16x16 map with a deep Q learning model

16x16 map with 8 agents(1)

file Deep_Q_Learning_diversifié.ipynb

Files

Q_learning_diverse contain a simple Q learning solution of the environment with two agents

assets contain the assets for the environment

Test-gif contain evaluation of some models after trainning