GoogleCloudPlatform / vertex-pipelines-end-to-end-samples

Apache License 2.0
218 stars 85 forks source link

Feat/enable concurrent runs #51

Closed jan-zajac-dt closed 1 year ago

jan-zajac-dt commented 1 year ago

Description

DRAFT Adding an additional environment variable ADDITIONAL_SUFFIX so that GCS asset paths and BQ tables that get overwritten can be suffixed with it, permitting multiple developers on the same project to run pipelines without deleting each others' changes.

I have also tried using pipeline arguments in the compiler for the pipeline. In my opinion this is a worse solution as:

  1. It introduces a lot of additional args to be ran in the Makefile for compile-pipeline, sync-assets and run commands
  2. We already use environment variables for many of the pipeline arguments.

If approach is ok, I will extend to other pipelines (as draft only includes XGBoost training) and update documentation where needed.

How has this been tested?

2 successful pipelines that (with/without ADDITIONAL SUFFIX environment variable set )

Checklist

Pipeline run links:

google-cla[bot] commented 1 year ago

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.