Open petermeansrock opened 1 year ago
Has there been any update on this? This hangup is significantly impacting the runtime of some CDK stack deployment workflows we're building
Same, this issue means we need to add custom handling when deleting sagemaker endpoints via CloudFormation or deal with each delete attempt taking ~15min to timeout.
This issue is severely slowing down our ability to delete sagemaker endpoints during blue green deployments.
Name of the resource
AWS::SageMaker::Endpoint
Resource name
No response
Description
When SageMaker provisions EC2 instances to deploy a customer's
AWS::SageMaker::Endpoint
resource for VPC-connected models, SageMaker creates Elastic Network Interfaces (ENIs) in the customer's account outside of the associated CloudFormation stack. When deleting a stack, CloudFormation will successfully delete the endpoint resource followed by the model before failing to delete the associated security group(s) and subnet(s). Unfortunately, as there are ENIs associated with these networking resources, stack deletion will fail after 15 minutes with errors like:Just as CloudFormation waits for Lambda-created ENIs to be cleaned up on function deletion, shouldn't CloudFormation do the same with SageMaker?
Other Details
No response