slalom / dataops-infra

Slalom Infrastructure Catalog for DataOps deployments
https://infra.dataops.tk
MIT License
19 stars 35 forks source link

SpeedRun #1: Simple Extract-Load Pipeline #55

Open aaronsteers opened 4 years ago

aaronsteers commented 4 years ago

As a training tool, as a test for ease-of-use, and as proof of value, we're creating a "speed run" video that demonstrates how to get up and running quickly with the Infrastructure Catalog and a basic DataOps pipeline. This will uncover usability issues and bugs which we'll need to resolve before we can promote the platform broadly.


Stop Point:

Start Point:


Speed Target: 12 minutes


Other Details:

Blockers:


Steps:

Create Repo and AWS Account (0:00-2:00, approx. 2m):

Configure Creds (2:00-3:30, approx. 1.5m):

Configure Project (3:30-4:00, approx 0.5m):

Configure Extracts (4:00-6:00, approx. 2m):

Configure and Deploy Terraform (6:00-10:30, approx. 4.5m):

Run a Sync Test (10:30-14:30, approx. 4m):

aaronsteers commented 4 years ago

Speed-run test log:

aaronsteers commented 4 years ago

Results for Run #3 on 2/28 - 36 minutes!:

Update: debugging the failure:

aaronsteers commented 4 years ago

Video recording tool ideas from peers:

aaronsteers commented 4 years ago

Results for Run #4 3/2 - 22 minutes!

Other learnings:

Example:

module "singer-taps" {
  # ...
  taps = [{
    id = "pardot"
    settings = {}
    secrets = {
      email    = "file://${secrets_filepath}:email"
      password = "file://${secrets_filepath}:password"
      user_key = "file://${secrets_filepath}:user_key"
    }
  }]
  # ...