Open chrislovecnm opened 3 years ago
@adejanovski can you provide some guidance here in line with what's being used for performance testing?
IIRC we were looking at 8-16 vCPUs (leaning towards 16 here) and 64 GB RAM.
Currently we're using e2-highmem-8 (8 vCPUs and 64GB RAM) for perf testing. e2-highmem-16 (16 vCPUs and 128GB RAM) are also a good fit for larger workloads/density.
General recommendations for running Cassandra in production:
With zero copy streaming in 4.0 (and 10 GbE) I'm curious what the density looks like to match the current time to stream 1-2 TB on 3.11.
Zero copy streaming as it was implemented for 4.0 only works for tables using LCS and requires that there aren't too many vnodes (if at all), which reduces the benefits we can expect from it. Compaction and repair improvements though should be helpful to handle higher densities. Even without Zero Copy Streaming, I guess the lighter memory footprint of 4.0 should help reducing GC during streaming and make it faster.
It could be interesting to benchmark this specific aspect to compare 4.0 to 3.11 time to recovery.
An open issue for documenting GKE instance sizing
┆Issue is synchronized with this Jira Task by Unito ┆friendlyId: K8SSAND-175 ┆priority: Medium