Open 5agado opened 1 month ago
Hi! Any luck solving this? I am in the same situation for the pytorch-inference:2.2.0-cpu-py310
image. In serverless mode, I get the same error. I can however deploy it in a real-time endpoint.
@rauldiaz this was the fix, but it wasn't properly propagated to the instances.
The recent new releases would have solved all the issues, if they updated sagemaker-pytorch-inference
, instead it is still stuck to 2.0.23 :/
Can see also the ongoing conversation here.
I am using 763104351884.dkr.ecr.ap-northeast-1.amazonaws.com/pytorch-inference:2.2.0-cpu-py310-ubuntu20.04-sagemaker-v1.12
in serverless mode, I encountered this error. I also tried an older image (763104351884.dkr.ecr.ap-northeast-1.amazonaws.com/pytorch-inference:2.1.0-cpu-py310-ubuntu20.04-sagemaker-v1.8
), but I also failed to invoke inference. Anyone solved this problem?
I am using 763104351884.dkr.ecr.ap-northeast-1.amazonaws.com/pytorch-inference:2.2.0-cpu-py310-ubuntu20.04-sagemaker-v1.12 in serverless mode, I encountered this error. I also tried an older image (763104351884.dkr.ecr.ap-northeast-1.amazonaws.com/pytorch-inference:2.1.0-cpu-py310-ubuntu20.04-sagemaker-v1.8), but I also failed to invoke inference. Anyone solved this problem?
Hi there,
I wanted to report that after a night, the issue seemed to resolve itself, and everything was working fine. However, when I updated the endpoint, the same error occurred again. Is this a known issue that tends to happen shortly after deployment?
Thanks!
Describe the bug Getting zombie process exception as already reported for the sagemaker-inference-toolkit
To reproduce Using
763104351884.dkr.ecr.eu-central-1.amazonaws.com/pytorch-inference:2.2.0-gpu-py310-cu118-ubuntu20.04-sagemaker
and custom inference script in a batch-transform causes to trigger such error. Even a simple initialtime.sleep(60)
in the inference.py script can be used to trigger the error. A custom requirements.txt file also needs to be provided with custom inference script.Here the full traceback:
System information A description of your system. Please provide:
763104351884.dkr.ecr.eu-central-1.amazonaws.com/pytorch-inference:2.2.0-gpu-py310-cu118-ubuntu20.04-sagemaker