NVIDIA / cloud-native-docs

Documentation repository for NVIDIA Cloud Native Technologies
https://docs.nvidia.com/datacenter/cloud-native/
Apache License 2.0
16 stars 18 forks source link

Exclude NFD master from metrics #38

Closed mikemckiernan closed 6 months ago

mikemckiernan commented 6 months ago

Addresses https://github.com/NVIDIA/cloud-native-docs/issues/1.

mikemckiernan commented 6 months ago

Worker also has some metrics https://github.com/kubernetes-sigs/node-feature-discovery/blob/master/pkg/nfd-worker/metrics.go

The update just prevents attempting to scrape metrics from a service that isn't publishing them. I miraculously had a cluster with Prometheus configured and witnessed the better experience--no "down" status for services in the gpu-metrics target. TY, btw.