Open sebastien-rosset opened 1 year ago
@sebastien-rosset I didn't see this issue, sorry!
You're totally right, there's no need for hw.gpu.power
, this one should be removed from the specification.
WRT in/out, I agree too and we were considering adding a direction attribute, instead of a separate metric:
hw.power{direction="in"}
for anything that consumes energy
hw.power{direction="out"}
for anything that outputs energy, like a battery, a smart plug or a UPS.
Actually, we could have that at the level too, for UPS systems: hw.host.power{direction="out"}
.
Another option was to simply say that power that is "produced" should be represented as a negative value for hw.power
. This way, one could aggregate all hw.power
metrics to get the overall power consumed in their data center/room, and not deal with calculations like sum(hw.power{direction="in"}) - sum(hw.power{direction="out"})
. It's probably simpler in this case, but a little counter-intuitive to see negative values for a power metric... 🤷♂️
@bogdandrutu This issue must be moved to the semantic conventions repository. Thank you!
What are you trying to achieve?
Improve the spec to provide guidelines for:
hw.host.power
versushw.power
metric.hw.host.energy
versushw.energy
metric.What did you expect to see?
Better guidelines for
hw.host.power
andhw.power
metrics.Additional context.
Use
hw.power
instead ofhw.gpu.power
I see
hw.gpu.power
has specifically been defined for GPUs. Shouldn't GPU power be standardized to usehw.power
?Device with In/Out power
Some devices transfer some of the energy they receive, i.e., they have IN and OUT power. What metrics and attributes should be used to report power/energy utilization? In particular, how does one report output power?
A possible solution is to add metrics to report both input and output power:
hw.power
hw.power_out
hw.energy
hw.energy_out
hw.power
could potentially be renamed tohw.power_in
.For example:
hw.power
= 148W andhw.power_out
= 122W.hw.power
= 121W.hw.power
- SUM ofhw.power_out
across components indicates how much power is drawn without double counting the power.hw.host.power
reports 5,020 Watts, and each connected device reportshw.host.power
with 500 Watts, then in aggregate the power is double counted.Smart meters
Smart meters can report the power utilization. For example, what metrics should a house smart meter report?
Reporting both
hw.host.power
andhw.power
metricsSuppose a physical system has:
For example, a physical server has power supply units (PSUs), CPUs, DIMMs, disks, GPU, PCI components, etc. Each of these consume energy and typically have sensors that can report power utilization. The power supply units can report the total energy consumed by the host, and each sub-component can have an instrument that reports the power utilization of that component.
Questions/Issues:
hw.host.power
metric for the whole system to be the sum of all PSUs power usage?hw.power
metric is used to report power usage for each of the sub-components?hw.power
metric?hw.host.power
is greater than or equal to the sum of thehw.power
across sub-components. But if PSU reportshw.power
, that may double count power usage for the sub-components unless somehow we can distinguish between input and output PSU power.