ros-infrastructure / buildfarm_deployment

Apache License 2.0
30 stars 39 forks source link

Bionic builds for Arm{64,hf} sometimes fail with a corrupt package #198

Closed clalancette closed 6 years ago

clalancette commented 6 years ago

Starting around April 27th, we've had a number of incidents where Armhf or Arm64 builds for Melodic on Bionic have failed with a "corrupt" humanity-icon-theme package (example). If we look closely at that log, at timestamp 23:05:03, we see the following:

23:05:03 Preparing to unpack .../041-humanity-icon-theme_0.6.15_all.deb ...
23:05:03 Unpacking humanity-icon-theme (0.6.15) ...
23:05:05 dpkg: error processing archive /tmp/apt-dpkg-install-nKKgiP/041-humanity-icon-theme_0.6.15_all.deb (--unpack):
23:05:05  corrupted filesystem tarfile - corrupted package archive

(this then causes cascading failures later on). Note that this failure is random; running the same build again may make this succeed. I spent some time with @nuclearsandwich debugging this on one of the ros buildfarm hosts, and it looks like the package is being properly downloaded (I confirmed by doing an md5sum against the downloaded package and verifying that it matched the upstream one). But it seems like during uncompression it sometimes gets corrupted. I'll also note that when the failure happens it always happens because of this package; I've never seen any others.

Here are some links to additional failures, though this is not an exhaustive list:

nuclearsandwich commented 6 years ago

Some notes from continued investigation. This has proved extremely heisenbuggy.

Ways we've been able to reproduce the issue:

Ways we've been unable to reproduce the issue:

Possible workarounds

nuclearsandwich commented 6 years ago

Docker 18.03 has been deployed. I'm going to monitor for a few days at least before determining for sure whether or not we can stay.