TheVeggydude / MARL-lockdown

A Multi-Agent Reinforcement Learning project that learns to solve the Coordinated Lockdown problem.
MIT License
3 stars 0 forks source link

Verify math #1

Closed TheVeggydude closed 4 years ago

TheVeggydude commented 4 years ago

The math for updating a single node could potentially be done by matrix/vector computations. Verify this.

TheVeggydude commented 4 years ago

Cannot be done through a single transformation matrix. The values for delta_S and delta_E depend on a multiplication between two elements of the original vector.

Moreover, the matrix is also sparse. This means that most of the cells are zero. A lot of spaces are, therefore, not used. So it may not be the most efficient solution.

The update loop is best done in a naive manner, as the matrix methods do not add any speed up, readability or ease of use.