dell / omnia

An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.
https://omnia-doc.readthedocs.io/en/latest/index.html
Apache License 2.0
224 stars 118 forks source link

Release 1.7 install habana device plugin with node selector #2311

Closed dweineha closed 2 months ago

dweineha commented 2 months ago

Label nodes that has Gaudi accelerators installed, and use a node-selector when deciding where to deploy the device plugin.