GoogleCloudPlatform / vertex-pipelines-end-to-end-samples

Apache License 2.0
218 stars 85 forks source link

Relax TFDV schema #35

Closed felix-datatonic closed 1 year ago

felix-datatonic commented 1 year ago

Description

Remove domain for company column in public dataset. If the public dataset changes, the example pipelines fail. This effectively relaxes the data validation rules to allow a successful run of a newly cloned repository even if the contents of this column change. No (or less) manual changes to the schemas is expected in future changes as a result.

How has this been tested?

Successfully ran validation components for all examples pipelines.

Checklist

Pipeline run links:

Will add if requested.