nerc-project / operations

Issues related to the operation of the NERC OpenShift environment
2 stars 0 forks source link

Add IBM Autopilot metrics, alerts, and dashboards to GPU clusters #769

Closed computate closed 1 month ago

computate commented 1 month ago

We will configure the IBM Autopilot helm chart resources in nerc-ocp-config and deploy Autopilot to the test and prod clusters where GPUs are running. We will also deploy the Autopilot GrafanaDashboard to the obs cluster where the nerc-logs-metrics team can access the dashboard.