dhiaayachi / temporal

Temporal service
https://docs.temporal.io
MIT License
0 stars 0 forks source link

Issue with MariaDB 10.6.9. (Galera cluster) #407

Open dhiaayachi opened 2 months ago

dhiaayachi commented 2 months ago

Expected Behavior

We are using Temporal 1.20 with MariaDB 10.2.30 and now we moved to new MariaDB 10.6.9 running in clustered mode (using Galera https://galeracluster.com/)

Actual Behavior

Testing with new Version we observe issues on some workflows, workflow stays in status "Running", activity is scheduled but nothing happens.

image

Steps to Reproduce the Problem

  1. use MariaDB 10.6.9
  2. launch e.g. 100 workflows
  3. some workflows stay in status RUNNING forever

Specifications

dhiaayachi commented 1 month ago

Thanks for reporting this issue. We've seen similar issues with MariaDB in clustered mode (using Galera).

There are a few potential causes for this:

If the issue persists, please provide more details about your setup:

Let me know if you have any further questions.

dhiaayachi commented 1 month ago

Thank you for reporting the issue!

We are aware of an issue with Temporal 1.20 and MariaDB 10.6.9 in clustered mode (Galera) that can cause workflows to become stuck in "Running" status.

Here are a couple of things you can try:

  1. Increase the Workflow Task Timeout: This will give your workers more time to process the Workflow Task. You can try setting the timeout to 60 seconds or more and see if it resolves the issue.
  2. Verify the MariaDB cluster configuration: Ensure that the Galera cluster is properly configured and that all nodes are healthy. If there are any issues with the cluster, it can impact Temporal's ability to communicate with MariaDB.

Please also share the following details to help us further troubleshoot the issue:

We are working on a fix for this issue and will provide an update as soon as it is available.