datasig-ac-uk / esig

esig python package
https://esig.readthedocs.io/en/latest/
GNU General Public License v3.0
42 stars 3 forks source link

Sporadic build failures on GitHub Actions #45

Closed rolyp closed 3 years ago

rolyp commented 4 years ago

The build now runs ~20 jobs for various configurations, but one or two jobs often stall and never resume. Sometimes this may be down to bandwidth or download issues, but often it looks like the job has just stalled in the GitHub Actions runner. The workaround is to cancel the build when this happens and run again until all jobs complete.

Basic problem. In recent days, it seems that, with Python 3.6+ on Linux, test_recombine can no longer use 240 dimensions – the process hangs (as opposed to just taking a long time). 240 used to be a small enough number; now 90 reliably fails. 60 works, so that is the current setting. Further investigation is required to see if there is an underlying problem, or whether this is related to (changes in) the resources available to GitHub Actions.

rolyp commented 4 years ago

Not going to worry about these, as the difficulty of getting the system building in the first place is significantly greater than any problems associated with the build occasionally failing for a random reason. GitHub claim some of these are “in progress” issues.

rolyp commented 3 years ago

The Windows failures are (at least) to do with:

CircleCI experiment (Windows):

rolyp commented 3 years ago

Reopened this issue, as the build now functions pretty well, and with 20+ platform targets, these sporadic failures means it usually takes multiple attempts to do a full build.

rolyp commented 3 years ago

Re-closing this issue. The sporadic curl failures ([Errno 14] PYCURL ERROR 18 - “transfer closed with x bytes remaining to read”) remain, but these aren’t too frequent (or easy to fix).