Open njuguna-n opened 1 month ago
@witash what do you think of this approach as a way of handling large initial syncs? Running small incremental batches seems to be working well from my local testing so far. I will clean up the PR and add some tests if you agree this is a good approach.
@witash what do you think of this approach as a way of handling large initial syncs? Running small incremental batches seems to be working well from my local testing so far. I will clean up the PR and add some tests if you agree this is a good approach.
ok, yea we can try it, it will be interesting to see how well it does with large databases
@dianabarsan yes that was my concern. I have not tested that yet but I will and add specific test case for that.
@dianabarsan @witash please review. I have added a comment here summarizing the tests I did to this approach.
@dianabarsan I addressed your comments. Please have another look.
Description
Add the ability to run dbt in batches to avoid scenarios where large table updates result in very large temporary tables that crash Postgres.
This PR depends on this corresponding PR in the CHT Pipeline repository.
156
Code review checklist
License
The software is provided under AGPL-3.0. Contributions to this project are accepted under the same license.