Open sfc-gh-tclinkenbeard opened 3 years ago
If a cluster is running in three_data_hall mode, on three data halls, there is a case where losing a single zone will cause unavailability due to stuck recovery, even though in status, we see:
three_data_hall
$ fdbcli --exec 'status json' | jq .cluster.fault_tolerance.max_zone_failures_without_losing_availability 2
The redundancy mode should be used in the availability calculation, to avoid confusing status output.
Talk to @sfc-gh-etschannen before fixing this.
If a cluster is running in
three_data_hall
mode, on three data halls, there is a case where losing a single zone will cause unavailability due to stuck recovery, even though in status, we see:The redundancy mode should be used in the availability calculation, to avoid confusing status output.