Drop Bucket or Drop Key Range (perhaps Drop Modified Range)

A facility to drop a bucket, or drop a key range in a bucket, or drop a modified date range in a bucket.

The outline idea here is that a drop request would be a special type of key change. The change would be written to the journal, and then the drop would be appended as metadata to the Bookie's ledger cache, and then forwarded to the penciller. The penciller would append the drop request to its cache, and then request an update to the clerk.

The clerk would inform all SST files in the range of the drop of the drop, and then update the manifest so that the entry for each SST file has an indication of the drop.

Requests to the ledger cache will be filtered based on the drop (for the duration of that cache). SST files would also filter out all data they returned based on the drop.

As caches are merged into the LSM tree, new caches need not be aware of the drop. As new SST files are created they need not be aware of the drop. Eventually memory of the drop is forgotten, but as long as a file or cache contained information about the drop, until the file is deleted by a merge event it remembers the drop and filters any result it outputs.

On restart, the manifest remembers the drops, and informs the leveled_sst actor as the file is started. Where new caches are built from the Journal, the persistence of the drop in the Journal will re-apply the drop at the same point.

There are issues that need to be resolved. Most notably the problem of what happens if the pclerk does not get the drop message before a shutdown, and general race conditions between the pclerk and the penciller.

There is going to be a significant amount of change. However, having a drop may make life easier - and may stop people from using features that have their own overheads (TTL objects, multi-backend). For the Riak implementation, the drop process will need a broader safe process e.g.:

First confirm all primaries are on line
Signal to all primaries to refuse PUTs for a time-limited period for a given range
Send the drop signal through to all backends
If everything responds, remove the PUT refusal and signal back OK

martinsumner / leveled

Drop Bucket or Drop Key Range (perhaps Drop Modified Range) #235