mozilla / bigquery-etl

Bigquery ETL
https://mozilla.github.io/bigquery-etl
Mozilla Public License 2.0
244 stars 98 forks source link

add scheduling definition into parameter to fix error where data not … #5716

Closed Marlene-M-Hirose closed 1 month ago

Marlene-M-Hirose commented 1 month ago

…going into the proper partition

Checklist for reviewer:

For modifications to schemas in restricted namespaces (see CODEOWNERS):

┆Issue is synchronized with this Jira Task

dataops-ci-bot commented 1 month ago

Integration report for "add scheduling definition into parameter to fix error where data not going into the proper partition"

sql.diff

Click to expand! ```diff diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/dags/bqetl_desktop_retention_model.py /tmp/workspace/generated-sql/dags/bqetl_desktop_retention_model.py --- /tmp/workspace/main-generated-sql/dags/bqetl_desktop_retention_model.py 2024-06-03 17:17:58.000000000 +0000 +++ /tmp/workspace/generated-sql/dags/bqetl_desktop_retention_model.py 2024-06-03 17:18:46.000000000 +0000 @@ -90,13 +90,15 @@ telemetry_derived__desktop_retention__v1 = bigquery_etl_query( task_id="telemetry_derived__desktop_retention__v1", - destination_table="desktop_retention_v1", + destination_table='desktop_retention_v1${{ macros.ds_format(macros.ds_add(ds, -27), "%Y-%m-%d", "%Y%m%d") }}', dataset_id="telemetry_derived", project_id="moz-fx-data-shared-prod", owner="mhirose@mozilla.com", email=["mhirose@mozilla.com", "telemetry-alerts@mozilla.com"], - date_partition_parameter="submission_date", + date_partition_parameter=None, depends_on_past=False, + parameters=["metric_date:DATE:{{macros.ds_add(ds, -27)}}"] + + ["submission_date:DATE:{% raw %}{{ds}}{% endraw %}"], ) telemetry_derived__desktop_retention_clients__v1 = bigquery_etl_query( diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/desktop_retention_v1/metadata.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/desktop_retention_v1/metadata.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/desktop_retention_v1/metadata.yaml 2024-06-03 17:15:43.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/desktop_retention_v1/metadata.yaml 2024-06-03 17:15:56.000000000 +0000 @@ -9,6 +9,10 @@ dag: bqetl_desktop_retention_model scheduling: dag_name: bqetl_desktop_retention_model + date_partition_parameter: metric_date + date_partition_offset: -27 + parameters: + - submission_date:DATE:{% raw %}{{ds}}{% endraw %} bigquery: time_partitioning: type: day ```

Link to full diff

dataops-ci-bot commented 1 month ago

Integration report for "remove {raw} around date as not using sql generator here"

sql.diff

Click to expand! ```diff diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/dags/bqetl_desktop_retention_model.py /tmp/workspace/generated-sql/dags/bqetl_desktop_retention_model.py --- /tmp/workspace/main-generated-sql/dags/bqetl_desktop_retention_model.py 2024-06-03 17:27:26.000000000 +0000 +++ /tmp/workspace/generated-sql/dags/bqetl_desktop_retention_model.py 2024-06-03 17:28:34.000000000 +0000 @@ -90,13 +90,15 @@ telemetry_derived__desktop_retention__v1 = bigquery_etl_query( task_id="telemetry_derived__desktop_retention__v1", - destination_table="desktop_retention_v1", + destination_table='desktop_retention_v1${{ macros.ds_format(macros.ds_add(ds, -27), "%Y-%m-%d", "%Y%m%d") }}', dataset_id="telemetry_derived", project_id="moz-fx-data-shared-prod", owner="mhirose@mozilla.com", email=["mhirose@mozilla.com", "telemetry-alerts@mozilla.com"], - date_partition_parameter="submission_date", + date_partition_parameter=None, depends_on_past=False, + parameters=["metric_date:DATE:{{macros.ds_add(ds, -27)}}"] + + ["submission_date:DATE:{{ds}}"], ) telemetry_derived__desktop_retention_clients__v1 = bigquery_etl_query( diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/desktop_retention_v1/metadata.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/desktop_retention_v1/metadata.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/desktop_retention_v1/metadata.yaml 2024-06-03 17:25:37.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/telemetry_derived/desktop_retention_v1/metadata.yaml 2024-06-03 17:25:42.000000000 +0000 @@ -9,6 +9,10 @@ dag: bqetl_desktop_retention_model scheduling: dag_name: bqetl_desktop_retention_model + date_partition_parameter: metric_date + date_partition_offset: -27 + parameters: + - submission_date:DATE:{{ds}} bigquery: time_partitioning: type: day ```

Link to full diff