Open VihasMakwana opened 2 months ago
Pinging @elastic/elastic-agent (Team:Elastic-Agent)
Please share your thoughts!
I think the original idea on the split health status of an overall component versus a unit was that we can see the component itself is healthy but the unit that is running is not.
I think in practice most users just review the component health and don't look at individual units. I think taking an aggregated approach of make the unit status reflect the component status does make sense.
I am +1 for this type of change, if done correctly. I don't want to lose the context of the component health, so maybe we need to have two status levels for a component. The overall health of the component (including the aggregation of the units) and then a single health state for the components communication with the Elastic Agent.
I think in practice most users just review the component health and don't look at individual units. I think taking an aggregated approach of make the unit status reflect the component status does make sense.
From a user perspective I agree. The only case that worries me is the upgrade watcher:
If we made this change today, and had a single failed unit set the component state to failed, agent would begin rolling back upgrades because of unit level errors. We need to decide if this is behavior we want. My preference is to leave this unchanged so we ignore unit errors when deciding to roll back.
Describe the enhancement:
Describe a specific use case for the enhancement or feature:
beats-monitoring
component is Healthy (state 2), but the monitoring unit is actually in a degraded state.What is the definition of done?