mozilla / bigquery-etl

Bigquery ETL
https://mozilla.github.io/bigquery-etl
Mozilla Public License 2.0
241 stars 98 forks source link

Add additional fields to data observability table to test schema changes #5735

Closed scholtzan closed 1 month ago

scholtzan commented 1 month ago

Checklist for reviewer:

For modifications to schemas in restricted namespaces (see CODEOWNERS):

┆Issue is synchronized with this Jira Task

dataops-ci-bot commented 1 month ago

Integration report for "Add additional fields to data observability table to test schema changes"

sql.diff

Click to expand! ```diff diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/dags/bqetl_unified.py /tmp/workspace/generated-sql/dags/bqetl_unified.py --- /tmp/workspace/main-generated-sql/dags/bqetl_unified.py 2024-06-04 18:59:04.000000000 +0000 +++ /tmp/workspace/generated-sql/dags/bqetl_unified.py 2024-06-04 19:00:16.000000000 +0000 @@ -188,6 +188,13 @@ execution_date="{{ (execution_date - macros.timedelta(seconds=10800)).isoformat() }}", ) + ExternalTaskMarker( + task_id="kpi_forecasting__wait_for_unified_metrics", + external_dag_id="kpi_forecasting", + external_task_id="wait_for_unified_metrics", + execution_date="{{ (execution_date + macros.timedelta(seconds=3600)).isoformat() }}", + ) + checks__fail_telemetry_derived__unified_metrics__v1_external.set_upstream( checks__fail_telemetry_derived__unified_metrics__v1 ) @@ -210,6 +217,20 @@ retries=0, ) + with TaskGroup( + "checks__warn_telemetry_derived__unified_metrics__v1_external", + ) as checks__warn_telemetry_derived__unified_metrics__v1_external: + ExternalTaskMarker( + task_id="kpi_forecasting__wait_for_unified_metrics", + external_dag_id="kpi_forecasting", + external_task_id="wait_for_unified_metrics", + execution_date="{{ (execution_date + macros.timedelta(seconds=3600)).isoformat() }}", + ) + + checks__warn_telemetry_derived__unified_metrics__v1_external.set_upstream( + checks__warn_telemetry_derived__unified_metrics__v1 + ) + telemetry_derived__rolling_cohorts__v1 = bigquery_etl_query( task_id="telemetry_derived__rolling_cohorts__v1", destination_table="rolling_cohorts_v1", @@ -257,6 +278,20 @@ depends_on_past=False, ) + with TaskGroup( + "telemetry_derived__unified_metrics__v1_external", + ) as telemetry_derived__unified_metrics__v1_external: + ExternalTaskMarker( + task_id="kpi_forecasting__wait_for_unified_metrics", + external_dag_id="kpi_forecasting", + external_task_id="wait_for_unified_metrics", + execution_date="{{ (execution_date + macros.timedelta(seconds=3600)).isoformat() }}", + ) + + telemetry_derived__unified_metrics__v1_external.set_upstream( + telemetry_derived__unified_metrics__v1 + ) + checks__fail_telemetry_derived__unified_metrics__v1.set_upstream( telemetry_derived__unified_metrics__v1 ) diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/data-observability-dev/fenix_derived/firefox_android_clients_v1/query.sql /tmp/workspace/generated-sql/sql/data-observability-dev/fenix_derived/firefox_android_clients_v1/query.sql --- /tmp/workspace/main-generated-sql/sql/data-observability-dev/fenix_derived/firefox_android_clients_v1/query.sql 2024-06-04 18:55:28.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/data-observability-dev/fenix_derived/firefox_android_clients_v1/query.sql 2024-06-04 18:55:38.000000000 +0000 @@ -1,4 +1,9 @@ SELECT - * + *, + -- add some additional fields to test schema changes + CAST(NULL AS STRING) AS additional_field_1, + CAST(NULL AS INT64) AS additional_field_2, + CAST(NULL AS STRING) AS additional_field_3, + CAST(NULL AS STRING) AS additional_field_4, FROM `moz-fx-data-shared-prod.fenix_derived.firefox_android_clients_v1` diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/unified_metrics_v1/metadata.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/unified_metrics_v1/metadata.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/unified_metrics_v1/metadata.yaml 2024-06-04 18:57:10.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/unified_metrics_v1/metadata.yaml 2024-06-04 18:57:21.000000000 +0000 @@ -19,6 +19,10 @@ owner1: loines scheduling: dag_name: bqetl_unified + external_downstream_tasks: + - task_id: wait_for_unified_metrics + dag_name: kpi_forecasting + execution_delta: 1h bigquery: time_partitioning: type: day ```

Link to full diff

dataops-ci-bot commented 1 month ago

Integration report for "Merge branch 'main' into data-observability-schema-change"

sql.diff

Click to expand! ```diff diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/data-observability-dev/fenix_derived/firefox_android_clients_v1/query.sql /tmp/workspace/generated-sql/sql/data-observability-dev/fenix_derived/firefox_android_clients_v1/query.sql --- /tmp/workspace/main-generated-sql/sql/data-observability-dev/fenix_derived/firefox_android_clients_v1/query.sql 2024-06-04 18:59:42.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/data-observability-dev/fenix_derived/firefox_android_clients_v1/query.sql 2024-06-04 18:59:43.000000000 +0000 @@ -1,4 +1,9 @@ SELECT - * + *, + -- add some additional fields to test schema changes + CAST(NULL AS STRING) AS additional_field_1, + CAST(NULL AS INT64) AS additional_field_2, + CAST(NULL AS STRING) AS additional_field_3, + CAST(NULL AS STRING) AS additional_field_4, FROM `moz-fx-data-shared-prod.fenix_derived.firefox_android_clients_v1` ```

Link to full diff