Insufficient memory of docker containers on CI

fanyang-mono commented 10 months ago

Build

https://dev.azure.com/dnceng-public/cbb18261-c48f-4abb-8651-8cdcb5474649/_build/results?buildId=351450

Build leg reported

Build / browser-wasm linux Release LibraryTests / Build product

Pull Request

https://github.com/dotnet/runtime/pull/89217

Known issue core information

Fill out the known issue JSON section by following the step by step documentation on how to create a known issue

 {
    "ErrorMessage" : "[error]Exit code 137 returned from process: file name '/usr/bin/docker'",
    "BuildRetry": false,
    "ErrorPattern": "",
    "ExcludeConsoleLog": false
 }

@dotnet/dnceng

Release Note Category

[ ] Feature changes/additions
[ ] Bug fixes
[ ] Internal Infrastructure Improvements
Release Note Description

Additional information about the issue reported

No response

Report

Build	Definition	Step Name	Console log	Pull Request
694253	dotnet/runtime	Build product	Log
690754	dotnet/runtime	Build product	Log
686902	dotnet/runtime	Build product	Log
686411	dotnet/runtime	Build product	Log
683686	dotnet/runtime	Build Tests	Log	dotnet/runtime#102505
682021	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
681751	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
680841	dotnet/runtime	Build Tests	Log	dotnet/runtime#102432
679966	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
679692	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
679746	dotnet/runtime	Build Tests	Log	dotnet/runtime#102400
679285	dotnet/runtime	Build product	Log
678653	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
678386	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
677394	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
675712	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
675458	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
674146	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
671841	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log
664347	dotnet/runtime	LLVM AOT compile CoreCLR tests	Log

Summary

24-Hour Hit Count	7-Day Hit Count	1-Month Count
0	2	20

Known issue validation

Build: :mag_right: https://dev.azure.com/dnceng-public/public/_build/results?buildId=351450 Error message validated: [error]Exit code 137 returned from process: file name '/usr/bin/docker' Result validation: :white_check_mark: Known issue matched with the provided build. Validation performed at: 7/26/2023 2:43:39 PM UTC

andriipatsula commented 10 months ago

Hello @fanyang-mono, could you please update the "ErrorMessage" : "" by following the step by step documentation on how to create a known issue

fanyang-mono commented 10 months ago

Updated.

missymessa commented 10 months ago

It's likely your process is using too much memory. Check to see when this started and if there were code changes around that time that could have caused this to occur.

https://www.airplane.dev/blog/exit-code-137

missymessa commented 10 months ago

@fanyang-mono, is this an infra issue? It looks like the errors are isolated to Runtime.

fanyang-mono commented 10 months ago

@lewing Could you please confirm that this is a wasm build issue? This is the direct link to the build log https://dev.azure.com/dnceng-public/public/_build/results?buildId=351450&view=logs&j=d4e38924-13a0-58bd-9074-6a4810543e7c&t=102a6595-1420-53fc-8f17-b0a3f4b1242a&l=5722

lewing commented 10 months ago

https://dev.azure.com/dnceng-public/cbb18261-c48f-4abb-8651-8cdcb5474649/_apis/build/builds/352553/logs/541 is definitely not a wasm build issue

lewing commented 10 months ago

exit code 127 typically means the process was sent a sig kill 128 + 9 = 137. Given that this is happening inside docker containers it is likely because they are hitting resource limits

lewing commented 10 months ago

what are the limits on the cloudtest containers?