opensearch-project / cross-cluster-replication

Synchronize your data across multiple clusters for lower latencies and higher availability
https://opensearch.org/docs/latest/replication-plugin/index/
Apache License 2.0
47 stars 58 forks source link

[BUG] Replication Failover Gap #1349

Open zalseryani opened 6 months ago

zalseryani commented 6 months ago

Replication Failover on Production Outage has Data Gap

  • because I do not see any replication configuration to tune the speed of replication or the pulling interval for the data (not metadata/settings or new matching indices when having an auto-follow rule configured between Prod and DR sites) Replication settings

Kindly advise, and thanks in advance for your time and support.

ankitkala commented 5 months ago

We do not support synchronous replication.

During DR, follower stats can give you the last tracked leaderCheckpoint & followerCheckpoint but 1) it tracks changes at shard level whereas user is concerned about REST API level. 2) Checkpoint doesn't tell you the total data replicated in terms on time but rather as a monotonically increasing integer value.

CCR provides 1 min SLA for replication and usually is under 20 seconds. But its hard to guarantee this as a lot depends on the workload and overall resource consumption.