Data movement can't begin on clusters missing data

apple / foundationdb

FoundationDB - the open source, distributed, transactional key-value store

Apache License 2.0

14.6k stars 1.32k forks source link

When a cluster loses all replicas of a shard and the data distributor later restarts, it gets stuck trying to track the initial shards (Note: I'm not certain if this is universally true or if it requires other properties to hold). As a result, no data movement can happen with the data that still exists in the cluster.

It would be better if data movement could continue on the shards that remain, which could help us to prevent increasing the blast radius of this failure case in some circumstances.

apple / foundationdb

Data movement can't begin on clusters missing data #3774