datamade / bga-payroll

💰 How much do your public officials make?
4 stars 4 forks source link

Migrate databases to RDS #538

Open hancush opened 2 years ago

hancush commented 2 years ago

We're currently using containerized Postgres 10 instances on the staging and production servers to host the data. I'd like to migrate to RDS for several reasons:

  1. RDS is more powerful, more easily configurable, and more easily monitorable than our containerized Postgres instances.
  2. Automatic backups are important, since ETL is time consuming and not fully reproducible due to human data entry after upload.
  3. The data update process is pretty convoluted right now: I clone the live database, restore it on staging, make the updates, fact check happens, then I clone the updated database and restore it to production. RDS will streamline this process.
  4. RDS offers burstable instances that will improve site performance under acute heavy load, e.g., at launch time. This has been an enormous pain point in the past.
  5. The data is now at the scale of millions of records. RDS provides flexible capacity, as well as a more intuitive UI for managing features like sharding that could be quite useful as the data continues to grow.

Migration plan: