concourse / hush-house

Concourse k8s-based environment
https://hush-house.pivotal.io
29 stars 23 forks source link

Research Linux OOM killer behavior for cgroups #20

Open cirocosta opened 5 years ago

cirocosta commented 5 years ago

Hey,

We've been seeing some of our workers going away after a certain memory profile gets reached due to the kernel's OOM killer getting started and destroying our container.

It seems like k8s does not act as an intermediary for the OOM killer when it comes to cgroups, making the whole process very ungraceful.

It'd be interesting to understand:

This is very impactful for workloads like strabo that might generate a huge in-memory cache for the files it access.