-
I am trying to replicate https://github.com/awslabs/amazon-sagemaker-examples/blob/master/sagemaker-python-sdk/tensorflow_serving_using_elastic_inference_with_your_own_model/tensorflow_serving_pretrai…
-
**Describe the bug**
SageMaker deployment of a HuggingFace model from the Hub in local mode fails with `KeyError: 'ModelDataUrl'` in `entities.py`.
**To reproduce**
1. Open the example HuggingFac…
-
### Describe the feature
This feature request proposes accelerating the process of adding L1 constructs to AWS CDK for newly supported CloudFormation resources. Currently, it takes 2-3 weeks for thes…
-
## Description
Multi model endpoint deployment in sagemaker through DJL serving is supposed to be supported. Here is the related [AWS page](https://docs.aws.amazon.com/sagemaker/latest/dg/deploy-mode…
-
Lately running into too many Sagemaker issues. Is there any unambiguous documentation on Sagemakers Instances? I could glean the following from different sources:
1. Sagemaker Instances, Sagemaker …
-
### Name of the resource
AWS::SageMaker::Endpoint
### Resource name
_No response_
### Description
When SageMaker provisions EC2 instances to deploy a customer's `AWS::SageMaker::Endpoint` resourc…
-
I am facing problems with deployment on Sagemaker.
Instance: `ml.g5.2xlarge`
With default config this happens
```
Sagemaker deployment failed due to memory error
torch.cuda.OutOfMemoryError: Allo…
-
### Describe the bug
When trying to deploy a new sagemaker domain into aws-gov-west-1 or aws-gov-east-1, I get an Internal Error message. I confirmed that deploying the domain as is in us-east-1 work…
-
Currently cross encoder models are used to rank the search results but the models available need to be hosted on Sagemaker which increases cost significantly. Having an option to disable cross encoder…
-
### Willingness to contribute
Yes. I can contribute this feature independently.
### Proposal Summary
The health endpoint is only available at /health through this route:
```python
@app.route("/h…