Kritis leaks memory - Githubissues

klauern commented 5 years ago

Expected Behavior

It runs at a stable memory footprint

Actual Behavior

it doesn't?

Steps to Reproduce the Problem

Run it for a while

Additional info

I don't know how to better debug the issue, but here's a graph:

Each of these is an individual container on a separate Kubernetes cluster, so they are similar in nature but will have their own footprint.

I'd appreciate any guidance on what I can do to better provide information on what is going on. This is my only window into the view that I can find, and I'm scratching my head to find other ways to get this information on a running cluster.

klauern commented 5 years ago

Note that the one that looks the most stable is our least-used server, so it doesn't get any traffic, hence it doesn't really fluctuate.

ooq commented 5 years ago

Hi @klauern , can you describe the workload a little bit?

vbanthia-zz commented 5 years ago

We also faced this issue and I did some profiling and found out memory leak is because of grafeasclient connections are not closed after the request is handled and there are many zombie open connections.

I created debug docker image with netstat installed and observe new grpc connections (TCP) are getting created whenever a new pod is created. These connections do not get closed and left open.

root@kritis-validation-hook-f76fd4c75-z2cvg:/go# netstat -W | wc -l
524

Connections

tcp        0      0 kritis-validation-hook-f76fd4c75-z2cvg:55436 nrt12s23-in-f10.1e100.net:https ESTABLISHED
tcp        0      0 kritis-validation-hook-f76fd4c75-z2cvg:44904 nrt12s13-in-f10.1e100.net:https ESTABLISHED
tcp        0      0 kritis-validation-hook-f76fd4c75-z2cvg:54140 nrt12s15-in-f74.1e100.net:https ESTABLISHED

NOTE: 1e100.net is google domain for containeranalysis API.

We internally use kritis fork and temporary using this fix. https://github.com/mercari/kritis/pull/18

Ideally, Kritis should not create grafeasclient inside handler but instead should be initialized before calling request handler so that same connection can be reused between goroutines (request handlers)

(https://github.com/grafeas/kritis/blob/681e6d3e8a4675d386e580fd643304c5e0263245/pkg/kritis/admission/admission.go#L246)