Closed benipeled closed 1 year ago
Verified with 3 builds (137, 136, 135) - works as expected https://jenkins.scylladb.com/job/scylla-master/job/releng-testing/job/next-machine-image/135
[2023-06-15T06:50:58.487Z] [1;32m==> gce: Pausing 10s before the next provisioner...[0m
...
[2023-06-15T06:51:06.932Z] [1;32m==> gce: Uploading files/ => /home/ubuntu/[0m
[2023-06-15T06:51:07.545Z] [0;32m gce: status: done[0m
Failed on master with SSH connection failure - so the fix helped but we need to increase the time
https://jenkins.scylladb.com/job/scylla-master/job/next-machine-image/478/consoleFull
15:40:13 [1;31mBuild 'gce' errored after 2 minutes 12 seconds: dial tcp 34.78.148.68:22: connect: connection refused[0m
15:40:13
15:40:13 ==> Wait completed after 2 minutes 12 seconds
15:40:13
15:40:13 ==> Some builds didn't complete successfully and had errors:
15:40:13 --> gce: dial tcp 34.78.148.68:22: connect: connection refused
Failed on master with SSH connection failure - so the fix helped but we need to increase the time
https://jenkins.scylladb.com/job/scylla-master/job/next-machine-image/478/consoleFull
15:40:13 �[1;31mBuild 'gce' errored after 2 minutes 12 seconds: dial tcp 34.78.148.68:22: connect: connection refused�[0m 15:40:13 15:40:13 ==> Wait completed after 2 minutes 12 seconds 15:40:13 15:40:13 ==> Some builds didn't complete successfully and had errors: 15:40:13 --> gce: dial tcp 34.78.148.68:22: connect: connection refused
Handled on https://github.com/scylladb/scylla-machine-image/pull/461
I found out that (periodically) the GCE build fails on uploading the files to packer instance
According to packer [0] it might be caused by the reboot action (part of the kernel-install recently added) causes race conditions between the provisions
We should handle it with
pause_before
- I set it to 10s, if it's not gonna help I'll increase it to 20s and add thessh_read_write_timeout
[0] https://developer.hashicorp.com/packer/docs/provisioners/shell#handling-reboots