Open dhiaayachi opened 2 weeks ago
This GitHub issue describes a potential memory leak in the Temporal Server, causing a gradual increase in CPU and memory usage, ultimately leading to outages.
temporal server start-dev --port 7233 --ui-port 8233 --metrics-port 9233
By following these steps, you can hopefully pinpoint the root cause of the memory leak and take corrective actions to ensure a stable and reliable Temporal Service.
Thanks for reporting this issue. Could you please tell me what version of Temporal you are using? Is it Temporal Cloud or self-hosted? Also, could you provide the details of your database setup? This will help us diagnose the issue further.
Thank you for reporting this issue.
Could you please confirm if the Frontend service was started with the --ui-port
flag?
This flag is necessary to enable the UI, which provides insights into the system.
Also, please let me know if you've attempted any troubleshooting steps or tried running the Temporal Server in a different environment?
The Troubleshooting section in our documentation might have some helpful information.
Expected Behavior
There should be no memory leak resulting from objects not being properly garbage collected.
Actual Behavior
Number of objects on Heap keeps growing. This seems to be result in slow increase of cpu & memory usage eventually resulting in outage.
Steps to Reproduce the Problem
temporal server start-dev --port 7233 --ui-port 8233 --metrics-port 9233
Do not start workflows or make any grpc calls, use the sdks or the web ui. aka keep the usage to a minimum.
Periodically check pprof & metrics. This shows that the goroutinue counts, objects on heap (aka memory allocations for objects) & memory allocations keeps growing perpetually.