kvserver: rebalance based on write load and store health

cockroachdb / cockroach

CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.

https://www.cockroachlabs.com

Other

30.04k stars 3.8k forks source link

kvserver: rebalance based on write load and store health #79216

Open erikgrinaker opened 2 years ago

erikgrinaker commented 2 years ago

The store rebalancer currently does not take into account replica/store write load or store health when making rebalancing decisions. This can lead to write hotspots that disrupt the entire cluster, and yet the rebalancer won't currently take action to resolve it. It needs to.

Related to #62168, #77604, #69364, #70911, #73714, and probably others.

Jira issue: CRDB-14643

github-actions[bot] commented 1 year ago

We have marked this issue as stale because it has been inactive for 18 months. If this issue is still relevant, removing the stale label or adding a comment will keep it active. Otherwise, we'll close it in 10 days to keep the issue queue tidy. Thank you for your contribution to CockroachDB!

erikgrinaker commented 1 year ago

@kvoli Is this still relevant?

kvoli commented 1 year ago

Still relevant. Write load is not accounted for directly. Store health (LSM) is only blocking rebalancing, not triggering.