Open dhiaayachi opened 2 weeks ago
Thanks for reporting this!
It seems like you are encountering performance issues with your Temporal cluster when you have a large number of workers connected. This is a common problem when you are dealing with a high volume of tasks, and it is important to correctly configure your cluster to handle this type of load.
Let's break down the issues you're facing and explore some solutions:
It seems like you have some understanding of how to configure your cluster using Helm and standalone MySQL. However, I need a few more details to give you specific recommendations:
Once I have this information, I can help you find the best configuration for your cluster to handle tens of thousands of workers.
In the meantime, I recommend reviewing the following documentation:
I hope this information is helpful. Please feel free to ask any further questions.
Thank you for reporting this issue. It appears that your setup is experiencing resource constraints due to a large number of workers and limited node resources.
Based on the provided information, the memory, disk I/O, and CPU limitations of your current Kubernetes node are a major bottleneck when scaling to tens of thousands of workers.
Here are some recommendations for configuring your cluster to handle such a large number of workers:
For specific guidance on tuning worker configurations and optimizing cluster setup, refer to the following documentation:
Remember to monitor your cluster resources and metrics closely to understand the performance impact of your configuration changes. Feel free to reach out to Temporal support if you have further questions or require additional assistance in troubleshooting your setup.
Thank you for reporting this issue.
Based on your description, it appears that you are experiencing performance issues when running Temporal with a large number of workers.
Let's go through your setup and see what we can adjust to improve things:
It is important to keep in mind that tuning Temporal for optimal performance requires a combination of these strategies. It's crucial to understand your workload patterns and iterate on your configuration to find the right balance for your environment.
Please let me know if you have any further questions or need additional assistance.
When I tested with more than 200 workers connected, I encountered some issues; my server memory was full,Too high memory usage on matching service and subsequently, I discovered the disk IO was also maxed out. I deployed Temporal using Helm and utilized a standalone MySQL. My k8s node has 2 CPU cores and 2GB of memory. I want to know how should I configure my cluster if there are tens of thousands of workers connected.