aws-neuron / aws-neuron-sdk

Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and integrated with your favorite AWS services
https://aws.amazon.com/machine-learning/neuron/
Other
423 stars 136 forks source link

[Feature]: Neuron Helm Chart for Kubernetes(Amazon EKS) Deployment #707

Open vara-bonthu opened 12 months ago

vara-bonthu commented 12 months ago

What's the problem:

Presently, deploying Neuron plugins for k8s requires manual configuration using YAML files, which can be time-consuming and error-prone. Introducing a Helm chart will streamline the deployment process and offer a more standardized and manageable approach.

Requested Scope:

Helm chart specifically tailored for deploying Neuron plugins on Kubernetes(Amazon EKS) clusters. The chart should include necessary configurations, dependencies, and documentation to guide users through the deployment process. Regular updates to the chart aligned with the Neuron project releases would also be highly beneficial.

Helm chart for Neuron Device Plugin and Neuron Scheduler

micwade-aws commented 11 months ago

Thanks for submitting the feature request. We did not have this in the plan, but will add to the backlog and evaluate soon.

vara-bonthu commented 11 months ago

I have created a local Helm Chart for Trainium on EKS DoEKS Terraform Blueprint. This Helm chart is tested and works well.

I can raise a PR if you would like to host this helm chart in this repo. We may need to Github pipeline to publish the new charts when the new Neuron Device Plugin image is published.