cds-snc / notification-planning-core

Project planning for GC Notify Core Team
0 stars 0 forks source link

Deploy new celery-sms-send pods in prod #195

Open sastels opened 1 year ago

sastels commented 1 year ago

Description

As a Notify team member, I need to change our prod sms configuration to be able to scale email and sms independantly.

WHY are we building?

Need to be able to scale emails and sms independantly.

WHAT are we building?

deploy the sms-send pods in prod

VALUE created by our solution

We will be able to keep both emails and sms close peak send rates close to but not over our AWS limits

Acceptance Criteria

QA Steps

sastels commented 1 year ago

Various tests as documented here Main points:

We tweaked the pod requirements in the hope that that would keep them spun up. This seemed to work on 5 pods.

sastels commented 1 year ago

promising results on ten pods with tweaked hpa. spun up 10 pods and kept them up. Some strange bumps at the start and end that we might be able to smooth out by lowering the rate limit down to 1/s (current 5/s)

Private Zenhub Image

sastels commented 1 year ago

Test

Configuration

Results

Private Zenhub Image

Private Zenhub Image

sastels commented 1 year ago

So I suggest that for V1 we ship the above configuration without karpenter and with a constant 7 nodes. After this we can

sastels commented 1 year ago

use sms send deployment on prod (well, after the next release) https://github.com/cds-snc/notification-manifests/pull/2041

sastels commented 1 year ago

merged to staging!

sastels commented 1 year ago

Test of 40K emails, 60K sms on staging Results as expected:

Private Zenhub Image

Private Zenhub Image

sastels commented 1 year ago

A double 80K / 12K test got similar results.

jimleroyer commented 1 year ago

First iteration will go in production today with a sending capacity of around 3,300 for emails. Steve will focus on the deployment and test for today.

sastels commented 1 year ago

Release went pretty well. Two issues:

Email issue fixed, sms to investigate...

ben851 commented 1 year ago

Ben to QA

ben851 commented 12 months ago

Verified that the sms sends are in the sms-send-high and medium queues. Verified that virtually all sms are being sent by the celery-sms-send pods (there are a few that aren't; these are notify admin 2fa) Verified over the last 3 days that peak email send rates were approx 2000 per minute Checked SMS Send over the past week, and it peaked around 300. I'm assuming this is due to lack of load. Will confirm w/ Steve at standup