openshift-cs / managed-openshift

Public roadmaps for the Red Hat Managed OpenShift offerings OpenShift Dedicated (OSD) and Red Hat OpenShift Service on AWS (ROSA)
Apache License 2.0
56 stars 6 forks source link

ROSA(4.14) needs support for Accelerated computing > Amazon EC2 G6 instances powered by NVIDIA L4 Tensor Core GPU launched in April 2024 #157

Open maulik-modi22 opened 1 month ago

maulik-modi22 commented 1 month ago

Which service is this feature request for? Red Hat OpenShift Service on AWS https://aws.amazon.com/about-aws/whats-new/2024/04/general-availability-amazon-ec2-g6-instances/ https://aws.amazon.com/ec2/instance-types/g6/

What are you trying to do? To run inferences with HighRes VIT model to detect cracks and damages in concrete, we require Multi GPU VM

Describe the solution you'd like Would like to see EC2 G6 instances that are powered by NVIDIA L4 Tensor Core GPU(L4) as supported instance types under Accelerated computing https://docs.openshift.com/rosa/rosa_architecture/rosa_policy_service_definition/rosa-hcp-instance-types.html should have

Describe alternatives you've considered Other GPUs such as g5.12xlarge(A10) and p3.8xlarge(V100) are too much expensive and cheaper GPU such as g4dn.12xlarge(T4) do not meet performance requirement. On the other hand g6.*(L4) series offers sweet spot for price/performance.

Additional context

price-comparison
maulik-modi22 commented 1 month ago

@arendej, @wgordon17 , Could you please assign appropriate labels and seek feedback with PM team?

maulik-modi22 commented 3 weeks ago

@wgordon17, Could you please assign appropriate labels and seek feedback with PM team to certify g6.*(L4) GPU?