Open kzvezdarov opened 2 weeks ago
@airbytehq/platform-move can someone take a look into this? Is this maybe a migration problem not updating the column in the database?
kzvezdarov I ran into the same issue when running airbyte through docker compose on version 0.63.3. I managed to get around it by changing the input to the Worker Container. By default, the value for INTERNAL_API_HOST was airbyte-server:8001, but it works when it's http://airbyte-server:8001. So essentially I added "http://" before the value for INTERNAL_API_HOST as input to the Worker Container (or deployment in k8s) and it solved the issue. It took me 3 hours of trial and error to get there. Hope it helps
kzvezdarov I ran into the same issue when running airbyte through docker compose on version 0.63.3. I managed to get around it by changing the input to the Worker Container. By default, the value for INTERNAL_API_HOST was airbyte-server:8001, but it works when it's http://airbyte-server:8001. So essentially I added "http://" before the value for INTERNAL_API_HOST as input to the Worker Container (or deployment in k8s) and it solved the issue. It took me 3 hours of trial and error to get there. Hope it helps
Thanks for the suggestion, unfortunately I've already tried that with no effect. It feels like it might be related to some database state (it's a long run deployment, initially created on 0.44.x), as it persists into 0.63.5 regardless of configuration changes.
Helm Chart Version
0.233.2
What step the error happened?
During the Sync
Relevant information
I'm in the process of upgrading an Airbyte deployment on GKE Autopilot from
v0.50.45
tov0.63.2
. Following the migration guide here: https://docs.airbyte.com/deploying-airbyte/on-kubernetes-via-helm#migration-steps, the deployment was upgraded and services started successfully, but syncs fail to run with the following errors:Caused by: java.lang.NullPointerException: connectionId is marked non-null but is null
This seems to be similar to https://github.com/airbytehq/airbyte/issues/38345 Full error log:Caused by: io.temporal.failure.ApplicationFailure: message='baseUrl is invalid.', type='java.lang.IllegalStateException', nonRetryable=false
This seems to be the same or similar issue to https://github.com/airbytehq/airbyte/issues/38854, but none of the workaround methods listed in that thread have worked so far. Full error log:"Caused by: io.temporal.failure.ApplicationFailure: message='Cannot invoke "java.lang.Long.longValue()" because the return value of "io.airbyte.workers.temporal.scheduling.activities.JobCreationAndStatusUpdateActivity$AttemptNumberFailureInput.getJobId()" is null', type='java.lang.NullPointerException', nonRetryable=false"
This seems distinct, though as seen in the log from above it's visible in thebaseUrl
error stack as well.Deleting all Airbyte deployments and volumes fully and reapplying the chart seems to allow a few syncs to make progress for a few minutes, only to start failing in the same manner. I have not tried resetting the airbyte database, but that's not really an option in any case.
This issue persists on 0.63.1 and 0.60.1; unfortunately upgrading the cluster to 0.63.2 has made rolling back to 0.50.45 impossible, as various SQL queries fail due to missing columns/other backward compatibility issues.
Relevant log output
No response