Closed nv-rborkar closed 5 months ago
MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅
@sgpyc can you review this PR and provide feedback in comments?
Just to clarify, we're scaling the power score based on scaling factor of number of steps to train? Not saying that there shouldn't be a scaling factor, but it's not obvious to me that this is the right way to do it.
We are scaling energy score (time*power) when time gets scaled by a scaling factor. RCP normalization is like penalizing a score which was fast, Energy should get penalized similarly as well.
This scenario was not discussed by power taskforce. PR has the most logical solution but we can discuss more during review & have power taskforce weigh in as well.
@pgmpablo157321 can you please review this once to confirm it doesn't break any of your recent changes to include power & perf in result summary.
In MLPerf Training, performance is sometimes normalized based on a scaling factor in scenarios such as:
Energy (perf *power) should also be normalized in such cases.