argoproj / argo-workflows

Workflow Engine for Kubernetes
https://argo-workflows.readthedocs.io/
Apache License 2.0
14.64k stars 3.13k forks source link

Hang on "Workflow processing has been postponed due to max parallelism limit" #11808

Open Guillermogsjc opened 10 months ago

Guillermogsjc commented 10 months ago

Pre-requisites

What happened/what you expected to happen?

onExit pod hangs for 20 hours with Workflow processing has been postponed due to max parallelism limit, despite being 2 workflows active with few pods, and having 1000 as namespaceParallelism in the workflow-controller configmap

Version

v3.4.11

Paste a small workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.

-

Logs from the workflow controller

over 20 hours of 
"Workflow processing has been postponed due to max parallelism limit"

Logs from in your workflow's wait container

`kubectl logs -n argo -c wait -l workflows.argoproj.io/workflow=${workflow},workflow.argoproj.io/phase!=Succeeded`

gives `No resources found in argo namespace`.

`kubectl logs -n argo -c wait -l workflows.argoproj.io/workflow=${workflow} --context pro`

gives `No resources found in argo namespace`.

`kubectl get wf -o name -n argo --context pro | rg ${workflow}`

gives 

`kbrasil-4-1694509200            Running     20h`

workflow-controller log:

logs-from-workflow-controller-in-workflow-controller-54949c4496-2nrnn.log

tczhao commented 3 months ago

Hi @Guillermogsjc , Were you able to make any progress on this issue, or is it still reoccurring?

Guillermogsjc commented 3 months ago

Did not manage to find anymore.

Closing

vadasambar commented 2 weeks ago

Facing similar issue in v3.5.4. /reopen