Closed gregglind closed 9 years ago
For the record,
MOAR POWER. A few more VM's or such for the first month, until it all settles down. I like this, because I HAVE TO DO NO WORK.
Will likely result in another spike 30 days later when the caches run out. I don't suggest doing this unless the idea is to add a few more VMs permanently.
(I agree there will be "rhyming" spikes at 30, 60. They should descrease in amplitude. This patch does the slow rollout over 30 days approach.
I opened issue 129 to deal with the "rhyming" spikes issue. If a rhyming spike causes an outage, it will increase the amplitude of future spikes, rather than decreasing it. We need to random fuzz the '30 day' interval to prevent this. (Services Engineering has prior experience doing this with Sync clients and servers, for similar reasons.)
I guess I'm too late, but I went one step farther than @willkg, and plotted the numbers: http://nbviewer.ipython.org/gist/mythmon/10584f8d2c60b05d3627
This probably isn't quite what you had in mind, as it still has a large group of people updating together. I played around with some other probability distributions, but I couldn't get anything that was flat, like we would want.
Let's get Country Codes into Heartbeat without WRECKING EVERYTHING. (And solve one of glind's Q3 goals! Heros of Heartbeat Medals for Everyone).
Some things:
Current status:
Proposals to get over the day one hump (more welcome!):
Asks:
Offers:
Thanks!
Gregg Lind User Advocacy Self-Repair Lead