mozilla / bigquery-etl

Bigquery ETL
https://mozilla.github.io/bigquery-etl
Mozilla Public License 2.0
241 stars 98 forks source link

Backfill geckoview_version_v1 #5738

Closed edugfilho closed 1 month ago

edugfilho commented 1 month ago

Checklist for reviewer:

For modifications to schemas in restricted namespaces (see CODEOWNERS):

┆Issue is synchronized with this Jira Task

dataops-ci-bot commented 1 month ago

Integration report for "Backfill geckoview_version_v1"

sql.diff

Click to expand! ```diff Only in /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1: backfill.yaml diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml 2024-06-04 22:35:24.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml 2024-06-04 22:34:41.000000000 +0000 @@ -1,15 +1,10 @@ -friendly_name: org-mozilla-fenix Derived +friendly_name: Org Mozilla Fenix Derived description: |- - Derived tables related to document namespace org-mozilla-fenix, usually populated via queries defined in https://github.com/mozilla/bigquery-etl and managed by Airflow + Derived tables related to document namespace org-mozilla-fenix. dataset_base_acl: derived user_facing: false labels: {} -default_table_workgroup_access: -- role: roles/bigquery.dataViewer - members: - - workgroup:mozilla-confidential workgroup_access: - role: roles/bigquery.dataViewer members: - workgroup:mozilla-confidential -syndication: {} diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml 1970-01-01 00:00:00.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml 2024-06-04 22:34:41.000000000 +0000 @@ -0,0 +1,8 @@ +2024-06-04: + start_date: 2024-02-01 + end_date: 2024-06-03 + reason: Fix erroneous geckoview_major_version values since Feb 2024 + (https://github.com/mozilla/glam/issues/2843#issuecomment-2148079303) + watchers: + - efilho@mozilla.com + status: Initiate ```

Link to full diff

dataops-ci-bot commented 1 month ago

Integration report for "Add schema.yaml to geckoview_version_v1"

sql.diff

Click to expand! ```diff Only in /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1: backfill.yaml Only in /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1: schema.yaml diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_firefox_subscriptions_sync_v1/checks.sql /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_firefox_subscriptions_sync_v1/checks.sql --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_firefox_subscriptions_sync_v1/checks.sql 2024-06-04 23:41:33.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_firefox_subscriptions_sync_v1/checks.sql 2024-06-04 23:41:22.000000000 +0000 @@ -5,3 +5,6 @@ #warn {{ min_row_count(1) }} + +#warn +{{ is_unique(["EXTERNAL_ID"]) }} diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_newsletters_sync_v1/checks.sql /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_newsletters_sync_v1/checks.sql --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_newsletters_sync_v1/checks.sql 2024-06-04 23:41:33.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_newsletters_sync_v1/checks.sql 2024-06-04 23:41:22.000000000 +0000 @@ -5,3 +5,6 @@ #warn {{ min_row_count(1) }} + +#warn +{{ is_unique(["EXTERNAL_ID"]) }} diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_products_sync_v1/checks.sql /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_products_sync_v1/checks.sql --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_products_sync_v1/checks.sql 2024-06-04 23:41:33.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_products_sync_v1/checks.sql 2024-06-04 23:41:22.000000000 +0000 @@ -5,3 +5,6 @@ #warn {{ min_row_count(1) }} + +#warn +{{ is_unique(["EXTERNAL_ID"]) }} diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_users_sync_v1/checks.sql /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_users_sync_v1/checks.sql --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_users_sync_v1/checks.sql 2024-06-04 23:41:33.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_users_sync_v1/checks.sql 2024-06-04 23:41:22.000000000 +0000 @@ -5,3 +5,6 @@ #warn {{ min_row_count(1) }} + +#warn +{{ is_unique(["EXTERNAL_ID"]) }} diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_waitlists_sync_v1/checks.sql /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_waitlists_sync_v1/checks.sql --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_waitlists_sync_v1/checks.sql 2024-06-04 23:41:33.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/braze_external/changed_waitlists_sync_v1/checks.sql 2024-06-04 23:41:22.000000000 +0000 @@ -5,3 +5,6 @@ #warn {{ min_row_count(1) }} + +#warn +{{ is_unique(["EXTERNAL_ID"]) }} diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml 2024-06-04 23:42:06.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml 2024-06-04 23:41:22.000000000 +0000 @@ -1,15 +1,10 @@ -friendly_name: org-mozilla-fenix Derived +friendly_name: Org Mozilla Fenix Derived description: |- - Derived tables related to document namespace org-mozilla-fenix, usually populated via queries defined in https://github.com/mozilla/bigquery-etl and managed by Airflow + Derived tables related to document namespace org-mozilla-fenix. dataset_base_acl: derived user_facing: false labels: {} -default_table_workgroup_access: -- role: roles/bigquery.dataViewer - members: - - workgroup:mozilla-confidential workgroup_access: - role: roles/bigquery.dataViewer members: - workgroup:mozilla-confidential -syndication: {} diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml 1970-01-01 00:00:00.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml 2024-06-04 23:41:22.000000000 +0000 @@ -0,0 +1,8 @@ +2024-06-04: + start_date: 2024-02-01 + end_date: 2024-06-03 + reason: Fix erroneous geckoview_major_version values since Feb 2024 + (https://github.com/mozilla/glam/issues/2843#issuecomment-2148079303) + watchers: + - efilho@mozilla.com + status: Initiate diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/schema.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/schema.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/schema.yaml 1970-01-01 00:00:00.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/schema.yaml 2024-06-04 23:41:22.000000000 +0000 @@ -0,0 +1,13 @@ +fields: +- description: null + mode: NULLABLE + name: build_hour + type: DATETIME +- description: null + mode: NULLABLE + name: geckoview_major_version + type: INTEGER +- description: null + mode: NULLABLE + name: n_pings + type: INTEGER ```

Link to full diff

dataops-ci-bot commented 1 month ago

Integration report for "Merge branch 'main' into backfill-geckoview-version"

sql.diff

Click to expand! ```diff Only in /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1: backfill.yaml Only in /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1: schema.yaml diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml 2024-06-04 23:43:22.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/dataset_metadata.yaml 2024-06-04 23:42:37.000000000 +0000 @@ -1,15 +1,10 @@ -friendly_name: org-mozilla-fenix Derived +friendly_name: Org Mozilla Fenix Derived description: |- - Derived tables related to document namespace org-mozilla-fenix, usually populated via queries defined in https://github.com/mozilla/bigquery-etl and managed by Airflow + Derived tables related to document namespace org-mozilla-fenix. dataset_base_acl: derived user_facing: false labels: {} -default_table_workgroup_access: -- role: roles/bigquery.dataViewer - members: - - workgroup:mozilla-confidential workgroup_access: - role: roles/bigquery.dataViewer members: - workgroup:mozilla-confidential -syndication: {} diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml 1970-01-01 00:00:00.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/backfill.yaml 2024-06-04 23:42:37.000000000 +0000 @@ -0,0 +1,8 @@ +2024-06-04: + start_date: 2024-02-01 + end_date: 2024-06-03 + reason: Fix erroneous geckoview_major_version values since Feb 2024 + (https://github.com/mozilla/glam/issues/2843#issuecomment-2148079303) + watchers: + - efilho@mozilla.com + status: Initiate diff -bur --no-dereference --new-file /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/schema.yaml /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/schema.yaml --- /tmp/workspace/main-generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/schema.yaml 1970-01-01 00:00:00.000000000 +0000 +++ /tmp/workspace/generated-sql/sql/moz-fx-data-shared-prod/org_mozilla_fenix_derived/geckoview_version_v1/schema.yaml 2024-06-04 23:42:37.000000000 +0000 @@ -0,0 +1,13 @@ +fields: +- description: null + mode: NULLABLE + name: build_hour + type: DATETIME +- description: null + mode: NULLABLE + name: geckoview_major_version + type: INTEGER +- description: null + mode: NULLABLE + name: n_pings + type: INTEGER ```

Link to full diff