ucbrise / clipper

A low-latency prediction-serving system
http://clipper.ai
Apache License 2.0
1.4k stars 280 forks source link

[Metrics] Extend metrics to measure physical performance stats #421

Open dcrankshaw opened 6 years ago

dcrankshaw commented 6 years ago

It would be great to extend our monitoring infrastructure to measure more physical performance. To start with, @blackhat06 suggested tracking the following resource metrics:

simon-mo commented 6 years ago

Prometheus can track these with node exporter https://github.com/prometheus/node_exporter/blob/master/README.md

For Kubernetes we can just scrape kube-api-server/metrics. Kubernetes expose Prometheus metrics there

simon-mo commented 6 years ago

Update:

dcrankshaw commented 6 years ago

Awesome. cc @blackhat06

rkooo567 commented 5 years ago

@simon-mo Is this handled?