Open samjo-nyang opened 2 years ago
Thanks for raising this. We're interested in integrating with Neuron, and it's something we're planning to look into down the road!
Re-titled this to be consistent with #1075, which is similar but for an ECS Inferentia variant.
Is this still needed?
Yes, we are using more neuron instances than I created the ticket. (actively migrating workloads from gpu to neuron)
Container SSA check-in. IHAC is running ML workloads with Inferentia on EKS. They are quite interested in Bottlerocket in terms of awesome security benefits they get with less overhead. They really want to align the company standards to use Bottlerocket for general business application as well as ML workloads. But the lack of support for Inferentia would affect their adoption.
IHAC who is running Stable Diffusion on EKS Inf2, and they wish to adopt Bottlerocket image cache solution to reduce the large image (10+GB) pulling time from ECR around 3-4 minutes. Foreseeing the increasing GenAI model hosting with Inferentia, supporting Inferentia/Neuron runtime will have a big impact.
What I'd like: I think it requires the neuron driver on https://github.com/aws/aws-neuron-sdk
Any alternatives you've considered: Nothing