Open scottybrisbane opened 1 year ago
@scottybrisbane Thanks for the report!
It might be easier to talk about this in detail in a support ticket - that way we can share more specific information about your installation. Could you open a ticket through the in-app "Get help" functionality, if you haven't done so yet? Thanks!
(happy to update this issue afterwards in case there is something worth sharing more generally)
We are running pganalyze collectors on some of our larger EC2 self-hosted database instances and are seeing the collectors getting consistently oom-killed on Sundays at around midnight UTC. This issue occurs only on our larger instances (for example EC2 instance types
i4i.4xlarge
andi4i.16xlarge
) and not on smaller instances. We also don't see this oom-kill at any other time during the week, aside from midnight UTC on Sundays.We are making a change to increase the memory limit on the systemd service, but wanted to raise this issue as well as it seems unusual given the consistent timing.
Here are some logs from dmesg showing the timing and how much memory is being used by the collector when it is killed:
The pganalyze collector logs don't contain anything unusual around these times.