JuliaCI / julia-buildkite

Buildkite configuration files for Base Julia CI
8 stars 15 forks source link

Feature request: in the Windows `autodump.jl` script, create core dump files if a process crashes #37

Open DilumAluthge opened 3 years ago

DilumAluthge commented 3 years ago

@staticfloat If I understand correctly, the Windows autodump.jl script currently only creates the core dump files if the test job exceeds the timeout (two hours).

Would it be possible for us to modify the autodump.jl script so that it creates the core dump files if either of the following occur:

  1. If the test job exceeds the timeout.
  2. If one or more of the Distributed worker processes crashes while running the tests.

Motivation

I am trying to help the SuiteSparse.jl team debug https://github.com/JuliaLang/SuiteSparse.jl/issues/43, in which we are seeing nondeterministic failures in the tester_win64 Buildbot job. In that issue, what we see is that the one of the Distributed worker process (the worker process is running the SuiteSparse test set) crashes. I figured that it would be helpful if I could provide the core dump files associated with the crash. However, the overall test job does not exceed the timeout, so the autodump.jl does not currently create any core dump files.

DilumAluthge commented 3 years ago

Bump @staticfloat

Also CC @Keno and @vtjnash