Emphasize on modular agent learning via divide and conquer

hpi-sam / Robust-Multi-Agent-Reinforcement-Learning-for-SAS

Research project on robust multi-agent reinforcement learning (marl) for self-adaptive systems (sas)

MIT License

0 stars 0 forks source link

Shops by agent is partition of the state-space, therefore, there might be better and worse partitions with respect to their ability to make learning within partition more sample efficient and quicker convergence to optimal policy. Learning optimal partitions might be a unsupervised learning problem that might also be influence by the types of perturbations that different partitions might suffer. Partitioning (or clustering) might be guided by types of usage (operational profile), types of failures (the corresponding propagations), and types of perturbations (changes in both usage and failure propagation).

hpi-sam / Robust-Multi-Agent-Reinforcement-Learning-for-SAS

Emphasize on modular agent learning via divide and conquer #19