dhiaayachi / temporal

Temporal service
https://docs.temporal.io
MIT License
0 stars 0 forks source link

tdbg diagnose command for stuck workflow #123

Open dhiaayachi opened 2 months ago

dhiaayachi commented 2 months ago

We need a tdbg command that could help dump all relevant information and do some basic analysis. tdbg workflow diagnose

Information includes: 1) Mutable state 2) Workflow History 3) Shard info (ack level, read level for all queues) 4) Task queue info (including all partitions, backlog size etc) 5) Namespace info

Diagnose part: Check if there should have workflow task scheduled or started, but didn't happen. Check if there is any timer that should already fired but did not happen. Check if there is worker polling the right task queue.

dhiaayachi commented 1 month ago

Thank you for your feature request! We understand the need for a comprehensive diagnostic tool within Temporal.

Currently, there isn't a single tdbg command that fulfills all these requirements. However, you can combine existing tools and techniques to achieve similar results.

To gather the information you need:

For the "diagnose" part, you can use the following:

We encourage you to join the Temporal community and submit a feature request for a dedicated tdbg workflow diagnose command. This will help us prioritize development efforts to improve the debugging experience.

Let me know if you have any other questions!

dhiaayachi commented 1 month ago

Thank you for this feature request! It sounds like you are looking for a more comprehensive debugging tool within Temporal. Currently, Temporal doesn't have a single command like tdbg workflow diagnose to perform all the actions you've outlined.

However, you can achieve most of this functionality by combining existing tools and approaches:

We appreciate your feedback and will consider it for future enhancements to Temporal's debugging capabilities.

dhiaayachi commented 1 month ago

Thank you for the feature request!

Currently, Temporal does not have a tdbg workflow diagnose command. However, you can use the following commands to achieve similar results:

For more information on using these commands, please refer to the Temporal CLI documentation: https://docs.temporal.io/docs/temporal-cli/

dhiaayachi commented 1 month ago

Thank you for the feature request! We appreciate you taking the time to share this.

While we don't currently have a tdbg workflow diagnose command, you can get the information you need by using a combination of existing tools:

For the diagnose part of your request, we recommend:

We will consider adding a tdbg workflow diagnose command in the future.

dhiaayachi commented 1 month ago

Thank you for this feature request.

While we don't have a tdbg workflow diagnose command at this time, you can achieve some of the same functionality using existing tools:

To check for potential issues with task scheduling, timers, and worker polling, you can use the following:

We appreciate your feedback, and we'll consider adding a tdbg workflow diagnose command in the future.