CapacityReservations/ Capacity Blocks for ML(GPU) in AWS to get allocation/ significant discounts.

kubernetes-sigs / cluster-api-provider-aws

Kubernetes Cluster API Provider AWS provides consistent deployment and day 2 operations of "self-managed" and EKS Kubernetes clusters on AWS.

Apache License 2.0

646 stars 575 forks source link

/kind feature

Describe the solution you'd like [A clear and concise description of what you want to happen.]

By using capacity blocks for ML, one can obtain a significant discount compared to on-demand GPU instances. However, we can also use CapacityReservations to allocate additional on-demand instances to the cluster in case of poor availability of on-demand instances.

Anything else you would like to add: [Miscellaneous information that will assist in solving the issue.] If the user uses the on-demand CapacityReservationId to the cluster if the reservation expires the cluster falls into normal on-demand instances. But for the Capacity Blocks, the instances will start deleted as these are GPU instances which need to be allocated to other users.

Environment:

Cluster-api-provider-aws version:
Kubernetes version: (use kubectl version):
OS (e.g. from /etc/os-release):

kubernetes-sigs / cluster-api-provider-aws

CapacityReservations/ Capacity Blocks for ML(GPU) in AWS to get allocation/ significant discounts. #5045