k8ssandra / k8ssandra-terraform

Apache License 2.0
17 stars 16 forks source link

Instance Sizing #12

Open chrislovecnm opened 3 years ago

chrislovecnm commented 3 years ago

An open issue for documenting GKE instance sizing

┆Issue is synchronized with this Jira Task by Unito ┆friendlyId: K8SSAND-175 ┆priority: Medium

bradfordcp commented 3 years ago

@adejanovski can you provide some guidance here in line with what's being used for performance testing?

IIRC we were looking at 8-16 vCPUs (leaning towards 16 here) and 64 GB RAM.

adejanovski commented 3 years ago

Currently we're using e2-highmem-8 (8 vCPUs and 64GB RAM) for perf testing. e2-highmem-16 (16 vCPUs and 128GB RAM) are also a good fit for larger workloads/density.

General recommendations for running Cassandra in production:

bradfordcp commented 3 years ago

With zero copy streaming in 4.0 (and 10 GbE) I'm curious what the density looks like to match the current time to stream 1-2 TB on 3.11.

adejanovski commented 3 years ago

Zero copy streaming as it was implemented for 4.0 only works for tables using LCS and requires that there aren't too many vnodes (if at all), which reduces the benefits we can expect from it. Compaction and repair improvements though should be helpful to handle higher densities. Even without Zero Copy Streaming, I guess the lighter memory footprint of 4.0 should help reducing GC during streaming and make it faster.

It could be interesting to benchmark this specific aspect to compare 4.0 to 3.11 time to recovery.