hpi-sam / Robust-Multi-Agent-Reinforcement-Learning-for-SAS

Research project on robust multi-agent reinforcement learning (marl) for self-adaptive systems (sas)
MIT License
0 stars 0 forks source link

Future Work - Perfect Failure Masking Phenomenon #64

Open christianadriano opened 2 years ago

christianadriano commented 2 years ago

Current assumption is that if there an observation that indicates a failure on a shop, the shop might or not have a failure. However, if there are no observations indication failures on a given shop, then, we assume currently, that there no failures on that shop. In other words, we assume that absence of evidence is evidence of absence!

In the future we might want to relax this assumption (usually deemed to be too strong in real world). For that we would need to investigate the latent failures, for which we do not have any observation (no indication) of failure on a given shop. This latent failures might happen because of phenomenon of perfect failure masking, where one failure ocludes the other

jocodeone commented 2 years ago

For enabling the agent to handle a perfect failure masking, the action space has to include "NOOP" as an option that implies that the agent won't do anything.

For detecting a perfect failure masking I see several options: