Open bhattbhuwan13 opened 1 year ago
do you have solution for this? I am facing the same problem
@KennyTC Nope.
I am facing the same issue... @KennyTC @bhattbhuwan13 Have you fixed this?
I face the same issue with the same error, seems that the error message is not meaningful. In my case the requirement.txt had versions of libraries that weren't compatible with the Python version that I chose for the container image. I realized about that seeing the begin of the CloudWatch log for that particular deploy execution. After I fixed that issue with the requirements, I was able to deploy my PytorchModel and get the endpoint created and running for it.
I was able to resolve this by ensuring the Pytorch image version specified matched my custom requirements.txt and python version e.g.
pytorch_model = PyTorchModel(model_data=fname,
role=role,
entry_point='inference.py',
framework_version='2.1.0',
py_version='py310')
requirements
boto3==1.33.3
botocore==1.33.3
torch==2.0.0
Discussed in https://github.com/aws/sagemaker-python-sdk/discussions/3638