BCDevOps / platform-services

Collection of platform related tools and configurations
Apache License 2.0
13 stars 29 forks source link

Update Aporeto Custom Operator to Shutdown Platform Apps during Aporeto Downtime #611

Open mitovskaol opened 4 years ago

mitovskaol commented 4 years ago

As a member of the Platform Support team I would like to be able to invoke the Aporeto Custom Operator as part of the Aporeto Downtime Remediation process to shut down all apps on the Platform that have not explicitly opted out of the Remediation process.

As a product owner of a Platform application I would like to be able to mark my application as "opted out" of the Aporeto Downtime Remediation process so that it continue running when the Aporeto service is down on the Platform accepting the risk of leaving the app "unprotected" by Aporeto. One way to mark an application as "opted out" would be to add a new Custom Resource Definition to add that would contain the "opt-out" flag. The Custom Operator will detect the CR and will spare the application during the Platform app shutdown.

jleach commented 4 years ago

@mitovskaol How about we make it an opt-in process. If its opt-out and people forget (or don't understand the impact) we may have more "things" to stop/start if Aporeto is down than we expect. For example, we have almost 4000 pods running and its not trivial to restart several thousand on demand.