dotnet / dnceng

.NET Engineering Services
MIT License
25 stars 19 forks source link

The hosted runner encountered an error while running your job. (Error Type: Disconnect). #1919

Open lewing opened 9 months ago

lewing commented 9 months ago

Build

https://dev.azure.com/dnceng-public/cbb18261-c48f-4abb-8651-8cdcb5474649/_build/results?buildId=545844

Build leg reported

Build / osx-x64 Release AllSubsets_Mono_LLVMJIT

Pull Request

https://github.com/dotnet/runtime.git/pull/97694

Known issue core information

Fill out the known issue JSON section by following the step by step documentation on how to create a known issue

 {
    "ErrorMessage" : "The hosted runner encountered an error while running your job. (Error Type: Disconnect).",
    "BuildRetry": false,
    "ErrorPattern": "",
    "ExcludeConsoleLog": false
 }

@dotnet/dnceng

Release Note Category

Additional information about the issue reported

No response

Report

Build Definition Step Name Console log Pull Request
862104 dotnet/runtime osx-x64 release Installer_Build_And_Test dotnet/runtime#108696
2575769 dotnet-installer Darwin_Build_Release_x64
861579 dotnet/runtime osx-arm64 Release NativeAOT_Libraries dotnet/runtime#109564
858943 dotnet/runtime tvos-arm64 Release AllSubsets_NativeAOT dotnet/runtime#109470
856718 dotnet/sdk AoT: macOS (x64) dotnet/sdk#44517
854434 dotnet/runtime osx-x64 release Installer_Build_And_Test dotnet/runtime#108364
2567386 dotnet-installer Darwin_Build_Release_arm64
850868 dotnet/sdk TestBuild: macOS (x64) dotnet/sdk#44385
848462 dotnet/runtime osx-x64 release Installer_Build_And_Test dotnet/runtime#102704
847582 dotnet/sdk TestBuild: macOS (x64) dotnet/sdk#44203
2563273 dotnet-sdk Official: macOS (arm64)
846016 dotnet/runtime coreclr Pri0 Runtime Tests Run osx x64 checked dotnet/runtime#108962
845761 dotnet/installer Darwin_Build_Release_x64
845094 dotnet/sdk TestBuild: macOS (x64) dotnet/sdk#44239
2558508 dotnet-sdk Official: macOS (x64)
840149 dotnet/runtime osx-x64 release Installer_Build_And_Test dotnet/runtime#108770
839258 dotnet/runtime osx-x64 release Installer_Build_And_Test dotnet/runtime#108687
838323 dotnet/sdk TestBuild: macOS (x64) dotnet/sdk#44063
832954 dotnet/sdk TestBuild: macOS (x64) dotnet/sdk#43993

Summary

24-Hour Hit Count 7-Day Hit Count 1-Month Count
3 5 19

Known issue validation

Build: :mag_right: https://dev.azure.com/dnceng-public/public/_build/results?buildId=545844 Error message validated: The hosted runner encountered an error while running your job. (Error Type: Disconnect). Result validation: :white_check_mark: Known issue matched with the provided build. Validation performed at: 2/5/2024 8:24:46 PM UTC

dougbu commented 9 months ago

this looks a bunch like a low-level and transient issue in AzDO. they provide the macOS-12 images and, somehow, they're responsible for something (somewhere) deciding to deprovision the agent the job started on. have you seen this more than once❔

lewing commented 9 months ago

yes in https://github.com/dotnet/runtime/pull/97766 https://dev.azure.com/dnceng-public/public/_build/results?buildId=547619&view=logs&jobId=7f49df26-8126-5de3-bf2f-6ac6bde01830

garath commented 9 months ago

Are you trying to use the "known error" feature here to find other occurances? There is no error message or pattern listed in the configuration blob of this issue, so it won't match to anything.

dougbu commented 9 months ago

not sure but we may not have any telemetry containing the interesting lines because they occur between build steps

##[error]The hosted runner encountered an error while running your job. (Error Type: Disconnect).
,##[warning]Received request to deprovision: The request was cancelled by the remote provider.

and, in place of a step's log

Nothing to show. Final logs are missing. This can happen when the job is cancelled or times out.

@garath, @AlitzelMendez, @missymessa, do we preserve any logs other than those for individual build steps (and Helix work items) for known issue matching❔

garath commented 9 months ago

do we preserve any logs other than those for individual build steps (and Helix work items) for known issue matching❔

No. If AzDO isn't showing the log, then it's likely the data never got captured into AzDO's database. Everything else is driven from that.

lewing commented 9 months ago

not sure but we may not have any telemetry containing the interesting lines because they occur between build steps

##[error]The hosted runner encountered an error while running your job. (Error Type: Disconnect).
,##[warning]Received request to deprovision: The request was cancelled by the remote provider.

and, in place of a step's log

Nothing to show. Final logs are missing. This can happen when the job is cancelled or times out.

@garath, @AlitzelMendez, @missymessa, do we preserve any logs other than those for individual build steps (and Helix work items) for known issue matching❔

I didn't expect it to work so I didn't add the message at first but it does appear to be able to match on that so I updated it and now some results are showing up.