Open maulik-modi22 opened 1 month ago
@arendej, @wgordon17 , Could you please assign appropriate labels and seek feedback with PM team?
@wgordon17, Could you please assign appropriate labels and seek feedback with PM team to certify g6.*(L4)
GPU?
Which service is this feature request for? Red Hat OpenShift Service on AWS https://aws.amazon.com/about-aws/whats-new/2024/04/general-availability-amazon-ec2-g6-instances/ https://aws.amazon.com/ec2/instance-types/g6/
What are you trying to do? To run inferences with HighRes VIT model to detect cracks and damages in concrete, we require
Multi GPU VM
Describe the solution you'd like Would like to see EC2 G6 instances that are powered by NVIDIA
L4 Tensor Core GPU
(L4) as supported instance types under Accelerated computing https://docs.openshift.com/rosa/rosa_architecture/rosa_policy_service_definition/rosa-hcp-instance-types.html should haveDescribe alternatives you've considered Other GPUs such as
g5.12xlarge
(A10) andp3.8xlarge
(V100) are too much expensive and cheaper GPU such asg4dn.12xlarge
(T4) do not meet performance requirement. On the other handg6.*
(L4) series offers sweet spot for price/performance.Additional context