Closed mohitravi123 closed 2 years ago
Hi @mohitravi123, thanks for reaching out and thanks for all these helpful code snippets and information! We've pushed a fix in our service backend to fix this issue which will take roughly 2 weeks to deploy. Will let you know once the fix takes effect.
Thanks @qidewenwhen. Once the fix is deployed, do I have to upgrade my Sagemaker version to the latest one, or will it be backward compatible with the version I'm using (2.86.2)?
Thanks @qidewenwhen. Once the fix is deployed, do I have to upgrade my Sagemaker version to the latest one, or will it be backward compatible with the version I'm using (2.86.2)?
No, you don't need to upgrade the Sagemaker version. The fix should work for v2.86.2.
The fix has been deployed successfully. Please feel free to reach out for any further questions. Thanks
Describe the bug I'm using a
LambdaStep
in the sagemaker pipeline to run an athena query and store results in s3 which will be the input for future steps. The lambda function takes around 7 minutes to run. Even after the lambda function is successful, theLambdaStep
in the sagemaker pipeline does not succeed and ultimately times out exactly at 10 minutes which is the maximum run time for theLambdaStep
. I can verify that the lambda function is successful by looking at cloudwatch logs and the resulting output of the athena query in the s3 bucket.To reproduce Create a lambda function with the following code. To mimic the athena query run time, I have added a sleep function for 7 minutes.
My sagemaker pipeline is defined as follows:
Expected behavior After the lambda function succeeds, I would expect the
lambdaStep
to be successful and move to the next step in the sagemaker pipeline.Screenshots or logs In the Output tab of the
lambdaStep
in sagemaker studio, I can see the following message about lambda timeout.System information A description of your system. Please provide:
Additional context The lambda function configuration is: