Project-HAMi / HAMi

Heterogeneous AI Computing Virtualization Middleware
http://project-hami.io/
Apache License 2.0
963 stars 199 forks source link

In multi-card applications, Device_utilization_desc_of_container metrics only has data on one card. #445

Closed CoderTH closed 2 months ago

CoderTH commented 3 months ago

The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense.

1. Issue or feature description

In multi-card applications, Device_utilization_desc_of_container metrics only has data on one card.

ZMSGuEpzJ8

img_v3_02dt_8c2f42eb-6307-49f8-bdc4-f1825301cffg

2. Steps to reproduce the issue

3. Information to attach (optional if deemed irrelevant)

Common error checking:

Additional information that might help better understand your environment and reproduce the bug:

archlitchi commented 3 months ago

you're right, current hami-core only count gpu utilization on GPU0, that's an issue needs to be solved

lengrongfu commented 3 months ago

/assign