TheVeggydude / MARL-lockdown

A Multi-Agent Reinforcement Learning project that learns to solve the Coordinated Lockdown problem.
MIT License
3 stars 0 forks source link

MARL-lockdown

A Multi-Agent Reinforcement Learning project that learns to solve the Coordinated Lockdown problem.

Assumptions

Some assumptions have been made when writing this code. They are explained here.

Goals

In the end we want to have a tool that allows us to research Reinforcement Learning techniques in a Multi Agent environment, specifically when used to learn containment policies.

In practice, this means creating a mapping of the state (the SEIR compartments) to an abstract value we call the 'containment level'. This value, as mentioned in the assumptions, reflects an abstract notion of the measures taken by an agent to combat the exposure of its population to the infected people.

Cost function

Acknowledgements

The model is based on a lot of previous work, such as this overview and this explanation.