linagora / james-project

Mirror of Apache James Project
Apache License 2.0
70 stars 63 forks source link

Noticed a new OOM in prod on JMAP pods #5116

Closed chibenwa closed 5 months ago

chibenwa commented 6 months ago

The Cassandra liveness check was enough to shutdown and reboot the pod.

No idea what caused the OOM - a log audit of nearby events did not lead to anything convincing...

I will:

chibenwa commented 6 months ago

Over 3 days without OOM.

GC logs show a healthy app.

Heap analysis shows nothing.

I did review the log activity on that time period, which was moderate to low.

My theory is still a rogue request alone pulling alone too much resources and crashing the POD.

I hope https://github.com/apache/james-project/pull/2096 to be a good mitigation...

chibenwa commented 5 months ago

No longer happening...

Closing for now.