Open levipereira opened 1 year ago
I'm going to take a crack at this.
@rmccorm4, what are your thoughts on this feature request? Let me know if you would like me to open a ticket.
@ClifHouck, did you have success with this enhancement? Thanks for working on this!
@dyastremsky Yes, but I ran into this bug: https://github.com/triton-inference-server/server/issues/6815
I've opened a PR to address it: https://github.com/triton-inference-server/core/pull/321
I was waiting for that to be resolved before opening another PR to address this issue.
Thanks for letting me know, Clif. I'll take a look.
@ClifHouck did your efforts on this stall? we have just setup triton and it would be great to have the GPU metrics tagged with GPU #.
Is your feature request related to a problem? Please describe. no
Currently, the triton-server provides GPU utilization metrics in Prometheus format, like so:
I would like to request the inclusion of additional information, specifically the GPU number and GPU name, similar to what can be obtained using nvidia-smi -L. This information would greatly aid in creating dynamic Grafana dashboards without the need to consult additional identification information on the physical host.
Example output of nvidia-smi -L:
Including the GPU number and GPU name in the Prometheus metrics would improve the user experience and ease the dynamic creation of monitoring dashboards.
Thank you for considering this enhancement request.
Best regards, Levi Pereira