Open dshahrokhian opened 4 years ago
Thanks for reaching out! This feature is only supported on CPU instance types.
We'll fix the documentation to clarify this. Thanks for bringing it to out attention!
Internal Reference: SIMT-P33776646
@knakad We are trying to create ml.p3.2xlarge
but facing same issue.
Describe the bug
How to increase speed of prediction with SageMaker+MMS, without GPU?
Basically, we have a model with MMS on ml.c5.2xlarge
instance but it takes around 30-35s when we do prediction using Detectron2. We want to decrease this time, so trying GPU instances but SageMaker is not allowing to use GPU with MMS! when tested with Google Colab, the same prediction works in 3-4s only.
What options we could think of, can you please put some light on this?
c5.9xlarge
but didnt see much improvement.System information
SageMaker Python SDK version: latest Framework name (eg. PyTorch) or algorithm (eg. Faster RCNN): Detectron2 Framework version: detectron2==0.2.1+cu101 Python version: 3.6 CPU or GPU: GPU Custom Docker image (Y/N): Y
Thanks
Describe the bug Apparently, MultiModel mode is not supported in any of the GPU instance types. This is nowhere mentioned in the documentation.
To reproduce
System information A description of your system. Please provide: