cockroachdb / cockroach

CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
https://www.cockroachlabs.com
Other
29.95k stars 3.79k forks source link

c2c: managed incremental tenant fingerprint #89357

Open stevendanna opened 1 year ago

stevendanna commented 1 year ago

The tenant checksumming tool being built in https://github.com/cockroachdb/cockroach/issues/89355 requires that the user manually run the checksum command on both primary and standby clusters. Further, it requires that they do so at a time point where the entire time range they would like to checkpoint is available on both the primary and standby.

Cluster to cluster streaming could instead finger both clusters as the streaming frontier advances. On the standby (receiving cluster), another process would do the following:

For large, existing tenants, calculating the initial checksum t_0 may still take considerable time. Some mitigations we might consider for this:

Jira issue: CRDB-20215

Epic CRDB-18750

blathers-crl[bot] commented 1 year ago

cc @cockroachdb/disaster-recovery