Replica status of local instance stuck in the down state.
During instance startup the box.info is returning box.info.replication[N].upsream.status with a connected value for all replicas mentioned in the box.cfg.replication, including the identifier of the instance itself (box.info.id). If we try to export metrics at this point of time, we get 0 value (down) as a replication status for all upstreams. The section upstream will be missing in box.info.replication for self instance after startup. Thus we will continue to observe down replica status for upstream with a N id.
Hi,
Replica status of local instance stuck in the down state.
During instance startup the box.info is returning box.info.replication[N].upsream.status with a connected value for all replicas mentioned in the box.cfg.replication, including the identifier of the instance itself (box.info.id). If we try to export metrics at this point of time, we get 0 value (down) as a replication status for all upstreams. The section upstream will be missing in box.info.replication for self instance after startup. Thus we will continue to observe down replica status for upstream with a N id.
It might be worth adding the following condition
k ~= info.id
to the uptsream section handler. https://github.com/tarantool/metrics/blob/master/metrics/tarantool/info.lua#L34Please check following examples to get more information about the issue: box_info_after_startup.txt box_info_during_startup.txt tnt_replication_status_after_startup.txt