Closed CraigWilson-ZOE closed 2 years ago
hey @CraigWilson-ZOE -- we've added some incremental + partitioning logic to the audit table model in the feature/audit-incrementality
working branch. would you mind testing the branch out to see how the runtime is affected? the first run will probably be a full-refresh and won't make a difference, but hopefully we see a big difference with the incremental runs
# packages.yml
- git: https://github.com/fivetran/dbt_fivetran_log.git
revision: feature/audit-incrementality
Hi Jamie,
Sorry for the delay, just back from the holidays.
I will try this out and get back to you, thanks.
no worries -- if you have a chance to test it out soon, we were aiming to release the fix before our sprint ends this week 🙂
Hey Jamie,
Just managed to try this, we had a few issues upgrading to v1.0.1.
The package ran OK and I can see a smaller number of records processed for the audit_table model.
Thanks
Are you a current Fivetran customer? Craig Wilson, ZOE, Data Engineer
Describe the bug We are monitoring how long each model in our dbt pipeline takes to process and the fivetran_log__audit_table model is one of the longest running that we have. The average execution time is 490 seconds. Looking at the code I believe this model would benefit from being partitioned and processing only the latest day, rather than all data.
Steps to reproduce
Expected behavior I would expect the time of execution to be more constant and not be as high as it currently is
Project variables configuration only copying configuration for the relevant section due to security.
Package Version
Warehouse
Additional context N/A
Screenshots Attached higher up
Please indicate the level of urgency This isn't super urgent but it is taking more and more time, and is impacting cost as we are processing more and more rows each day.
Are you interested in contributing to this package?