Closed edgao closed 1 year ago
@pcorbel / @charbelrseif since you both upvoted this issue - are you using snowflake or bigquery? https://github.com/airbytehq/airbyte/issues/28372 exists for bigquery, but if you're both on snowflake then I might just focus on that for now
@edgao We are using Snowflake
got it. One more question - are these streams in the same connection? Or are they separated across connections?
(I'm trying to repro this error locally so I can validate that I've actually fixed it)
managed to repro with concurrent syncs, but please let me know if you were seeing this within a single sync (i.e. no other connections running at the same time)!
@edgao I am using the same destination with different namespaces for 1000+ connectors and I often have the issue. However, on my staging instance where I have 10-ish connectors, the issue does not come up because I think I do not have concurrent syncs.
got it. Then I think https://github.com/airbytehq/airbyte/pull/30599 will solve the problem + implements a test to prevent future regressions. Will update here when we release a new version of destination-snowflake with this fix. Thanks for the info!
Thanks @edgao In the meantime, do you know if I can test the new code simply? Like is there a Docker image available for this PR somewhere, and could I update something in the database for Airbyte to use the right tag?
lemme start a prerelease publish! That would be super helpful to verify that it works, thanks for the offer :)
running in https://github.com/airbytehq/airbyte/actions/runs/6263421961; will update here when it's ready for use.
try updating your destination-snowflake version to 3.1.12-dev.05cdd1566b
- that will switch you to the prerelease version.
The fix is working great, thanks!
awesome, thanks for being the guinea pig :) I'll work with the team to get this released for real.
latest destination-snowflake has a regression due to the java cdk refactor including an outdated version of some core code. working on a cdk republish + updated destination-snowflake release.
snowflake's
create stage
query is doing roughlycreate stage airbyte_internal.<stream_name>
. If there are multiple streams with the same name but different namespace, they will clobber each other. We should give the stage name the same name as the raw table, i.e.create stage airbyte_internal.<namespace>_raw__stream_<name>
.Check if bigquery has a similar problem for its GCS files.
Errors look like: