bentoml / aws-sagemaker-deploy

Fast model deployment on AWS Sagemaker
Apache License 2.0
15 stars 15 forks source link

Auto-scaling #23

Open NaxAlpha opened 2 years ago

NaxAlpha commented 2 years ago

Hi, I want to ask if it is possible to enable Auto-scaling for SageMaker inference? Thanks

jjmachan commented 2 years ago

hey, @NaxAlpha thanks for bringing this up. We are actively working on adding this into the tool and it should be ready in the coming weeks. Also, I was wondering if you were on the slack group? If you give me your slack id we could maybe have a chat to get a better understanding of your use case.

NaxAlpha commented 2 years ago

Awesome thanks. Here is my id: nauman.mustafa.x@gmail.com

github-actions[bot] commented 2 years ago

Beep boop! 🤖 This issue hasn't had any activity in a while. I'll close it if I don't hear back soon.

jjmachan commented 2 years ago

a PR for reference https://github.com/bentoml/aws-sagemaker-deploy/pull/25

TalhaRB commented 9 months ago

is there any progress on this? Does the Sagemaker support autoscaling endpoint?