cloud-gov / cg-atlas

Repository hosting issues and artifacts related to operations of the cloud.gov platform
Creative Commons Zero v1.0 Universal
3 stars 1 forks source link

Practice and document our influxdb restore process #180

Closed cnelson closed 7 years ago

cnelson commented 7 years ago

In order to be able to quickly restore from backups as an SRE I want a tested, documented process for restoring influxdb from our s3 backups.

Acceptance Criteria

I would like to see us test two scenarios that would require restoring from backup.

Normal recovery

We replace the data in our existing influxdb deployment with the data from our backups.

IR recovery

We can stand up up a new influxdb deployment next to our existing influxdb deployment with data loaded from a specific point-in-time.

If our influxdb system is breached we may need to preserve it in it's current state for forensic purposes and would need to spin up a new influxdb deployment concurrently to compare data from known-good-backups to what is in the compromised deployment.