Closed tigrannajaryan closed 1 year ago
@andykellr @PeterF778 what do you think?
I agree that this is unclear in the spec. I think healthy
is a better name. I think an agent that is down is also unhealthy so I do not think we currently need another field to represent running/not-running.
I agree that this is unclear in the spec. I think
healthy
is a better name. I think an agent that is down is also unhealthy so I do not think we currently need another field to represent running/not-running.
What do we do with start_time_unix_nano
in that case? It is said to be set when up
is true. Should we untie these 2 fields?
The AgentHealth currently has an
up
field and alast_error
fields.It is not clear how to set fields if the agent process is started and running but it is unhealthy (e.g. we have a way to verify its health by polling a health check endpoint). Should we set
up
to true or false in this case?The
up
field definition isSo, it seems like we should set it to true. However, there is no other explicitly defined way to indicate unhealthiness, unless we assume the presence of
last_error
is that indicator.We need to either clarify the spec to say
last_error
is the indicator or add another field to indicate the unhealthiness (e.g.bool healthy
), or maybe renameup
tohealthy
?