We are using 8.23.0 version of airflow provider package and set deferrable to True when using SMProcessingJob.
We set define the ProcessingJobName with something like "project-dag_name-date-uuid", where uuid is to deduplicate job name upon retry. However, when the execution gets back from deferred state, the original uuid is not preserved and there would be a new uuid so the operator cannot find a job with the new job name and it would fail.
Thanks for opening your first issue here! Be sure to follow the issue template! If you are willing to raise PR to address this issue please do so, no need to wait for approval.
Apache Airflow Provider(s)
amazon
Versions of Apache Airflow Providers
We are using 8.23.0 version of airflow provider package and set deferrable to True when using SMProcessingJob.
We set define the ProcessingJobName with something like "project-dag_name-date-uuid", where uuid is to deduplicate job name upon retry. However, when the execution gets back from deferred state, the original uuid is not preserved and there would be a new uuid so the operator cannot find a job with the new job name and it would fail.
There is a closed related issue (https://github.com/apache/airflow/issues/39503) but the fix only applies for Transform jobs.
Apache Airflow version
2.8.1
Operating System
Amazon Linux AMI
Deployment
Amazon (AWS) MWAA
Deployment details
No response
What happened
No response
What you think should happen instead
No response
How to reproduce
For a given SMProcessingJob 1) Set "ProcessingJobName": f"{name}-{str(uuid4())[:8]}". 2) Set deferrable = True 3) The run the DAG.
Anything else
No response
Are you willing to submit PR?
Code of Conduct