Closed hieuhc closed 5 years ago
Are you positive that task-parent
exited with exit code 0
?
I just tried a sample recipe with task dependencies, and everything appears to be working correctly. Would you be able to try that recipe and see if you can repro task dependencies getting stuck?
Also which Azure region are you running your jobs? Could you try a different region temporarily?
Yes the task-parent
exited with code 0
, or at least it is showed like that in Azure Portal, I haven't tried to ssh
to that node to check. In task-parent
I use sys.exit(0)
in Python 2.7 to terminate the application.
The jobs have been running in Azure region South Central US. I will try to create an Azure Batch account in West Europe to see how it behaves.
Thanks, it looks like it may be an Azure Batch regional issue. We're tracking this internally. In the meantime, if you can confirm that it does not repro in a different region, that would be great.
I haven't got a chance to create another Azure Batch account. But now the problem seems to go away, dependent tasks can be triggered as expected. Maybe that regional issue was fixed. Thank you for your support.
There was a delay in task dependency processing in South Central US and the issue has since been mitigated. Thanks for your patience.
Problem Description
I submitted a job containing a set of parent tasks and a set of dependent tasks. After a while, all parent tasks completed with exit code 0 but dependent tasks are still in Active status and never triggered.
Batch Shipyard Version
Pool was created with
3.5.2-cli
, jobs were submitted by both3.5.2-cli
and3.6.1-cli
. I also triedforce_enable_task_dependencies: true
.Steps to Reproduce
In my case, I reduced to a simple job file with 2 tasks with dependencies.
Expected Results
Dependent tasks should be triggered after parent tasks enter Completed status with exit code 0
Sample job file