OpenRiak / riak_kv

Riak Key/Value Store
0 stars 0 forks source link

Nhse d30 kv1869 #2

Closed martinsumner closed 1 year ago

martinsumner commented 1 year ago

Rather than wait for the next read (which may never happen) to repair - try and defer, as the reaper will not process until the failure has cleared.

https://github.com/nhs-riak/riak_test/pull/1

https://github.com/basho/riak_kv/issues/1869

martinsumner commented 1 year ago

The test was passing due to good fortune in timing. There is a fundamental issue with this PR.

When the node comes back up, the primaries will be reinstated and the reaps will occur. However, handoffs may not have occurred. So the reaps can complete, the clusters can appear to be in-sync ... but once handoffs complete the tombstones re-appear and full-sync must still resolve the issue.

There is no obvious was of deferring reaps until after handoffs, so there is no way forward for this solution.