fivetran / dbt_fivetran_log

Data models for Fivetran's internal log connector built using dbt.
https://fivetran.github.io/dbt_fivetran_log/
Apache License 2.0
30 stars 24 forks source link

Bug/redshift json parse #114

Closed fivetran-catfritz closed 6 months ago

fivetran-catfritz commented 6 months ago

PR Overview

This PR will address the following Issue/Feature:

This PR will result in the following new package version:

Please provide the finalized CHANGELOG entry which details the relevant changes included in this PR:

Breaking Changes

  • The following changes are marked as a breaking change out of caution, as a full refresh may be required if you are experiencing issues after the update.
  • For Bigquery and Databricks users, updated the partition_by config to coordinate with the filter used in the incremental logic.
  • For Snowflake users, added a cluster_by config for performance.

Feature Updates

  • Updated incremental logic for fivetran_platform__audit_table so that it looks back 7 days to catch any late arriving records.
  • Updated json parsing logic to prevent run failures when incoming json-like strings are invalid.
  • Added filter to fivetran_platform__connector_status so only necessary log records will be parsed.

Under The Hood

  • Added macros:
    • fivetran_log_json_parse to handle the updated json parsing.
    • fivetran_log_lookback for use in fivetran_platform__audit_table.
  • Updated testing of invalid json strings.

PR Checklist

Basic Validation

Please acknowledge that you have successfully performed the following commands locally:

Before marking this PR as "ready for review" the following have been applied:

Detailed Validation

Please share any and all of your validation steps:

If you had to summarize this PR in an emoji, which would it be?

🔍