Closed rariyama closed 1 year ago
I could solve this problem by setting a value which is larger than 10 minutes to this variable. Specifically, I added the parameter to property file and set value larger than 600.
agent.command_executor.type=ecs
agent.command_executor.ecs.name=Example
agent.command_executor.ecs.Example.access_key_id=SampleKey
agent.command_executor.ecs.Example.secret_access_key=SampleSecret
agent.command_executor.ecs.Example.launch_type=FARGATE
agent.command_executor.ecs.Example.region=ap-northeast-1
agent.command_executor.ecs.Example.subnets=SampleSubnet
agent.command_executor.ecs.Example.max_retries=10
agent.command_executor.ecs.temporal_storage.type=s3
agent.command_executor.ecs.temporal_storage.s3.bucket=SampleBucket
agent.command_executor.ecs.temporal_storage.s3.endpoint=s3.amazonaws.com
agent.command_executor.ecs.temporal_storage.s3.credentials.access-key-id=SampleKey
agent.command_executor.ecs.temporal_storage.s3.credentials.secret-access-key=SampleSecret
agent.command_executor.ecs.temporal_storage.s3.direct_upload_expiration=660 # longer than 600
eval.js-engine-type=graal
I think it's better to describe it on the document about ECS Command Executor because this value has an effect on the processes which are possible to continue for 10 minutes or more. If my offer is promising I'll create PR to add the description about it.
I encountered the issue when I executed task which can last for 10 minutes or more on ECS Command Executor. The file named as archive-output.tar.gz should be put on S3 after execution is finished but it actually wasn't. I experienced this issue when executing python and shell operator. Python was failed but shell was succeeded because python checks whether file exists or not.
Log and sample resources like workflow are following(Parts of values are masked).
Log outputted on Cloudwatch.
The python operator log outputted by executing digdag log command
The example workflow.
example.py
The example digdag run command
digdag run ./Example.dig +Example+Py --session-time $(date +'%Y-%m-%d')
The settings on property file.
I tried to search for the reason but I coudn't find. Please let me know the reason or the effective solution if you know.