onecx-apps / onecx-chat

OneCx chat Management
Apache License 2.0
0 stars 0 forks source link

Check Sagemaker costs #6

Closed michaelgloeckner closed 10 months ago

michaelgloeckner commented 10 months ago

Check sagemaker costs Check if we can scale it up and down to only be used on business hours.

lmitlaender commented 10 months ago

Pricing:

Instance pricing is around 25 - 50% more expensive for sagemaker ml instances compared to ec2 instances of the same type. The percentage seems to be higher for lower cost instances and lower for higher cost ones.

Comparing the Hourly Rate at the example of g4dn instances: Sagemaker: image

EC2: image

Data Prices for Sagemaker are 0,016 USD per GB for Data transfer IN and OUT Data Prices for EC2 are free inbound traffic and around 0,05 to 0,09 USD per GB for Outbound traffic

Sagemaker does reduce the need for a Kubernetes Cluster, but is far less flexible regarding where it can be hosted since it is AWS proprietary. It does however give many QoL Tools for integrating with other AWS Services as a managed service.

Scaling:

Sagemaker sadly doesn't support auto scaling to zero, the minimum instance count for an endpoint always has to be at least one. Otherwise its scaling options are similar to ec2 Auto Scaling.

TL;DR:

Sagemaker can be worth it for very high outbound data usage endpoints, and especially when trying to set up more training related jobs or a full Machine Learning Pipeline from Data to Training to Evaluation to Deployment.

For our use-case of only hosting an endpoint the only really interesting part could be the ability of quickly deploying jumpstart models to test different models.