Open vprashar2929 opened 3 weeks ago
🤖 SeineSailor
Here is a concise summary of the pull request changes:
Summary: This pull request introduces a new feature to validate Kepler metrics in the validator
module, enhancing its capabilities for handling and validating metrics. The changes include:
validate_metrics
command to the validator
CLI with --duration
and --report-dir
options.PrometheusJob
named tuple and the load
function to include new fields and initialize them when loading configuration from a file.validate_metrics
, ScriptResult
, and write_md_report
, and adding an optional dependency on click.exceptions.Exit
.regression-stressor.sh
, to simulate CPU load and measure its impact on the system using the stress-ng
tool.Impact: These changes expand the validator
module's capabilities for handling and validating Kepler metrics, but do not affect the signatures of exported functions or global data structures. However, the introduction of the new script might affect the overall system behavior when executed.
Observations and Suggestions:
validate_metrics
command and the regression-stressor.sh
script work as expected.validator
module or Kepler metrics.
This commit introduces functionality to validate essential metrics produced by Kepler The following comparisons are included:
Node Exporter Comparison
node_rapl_<package|core|dram>
metrics againstkepler_node_<package|core|dram>{dev}
Kepler Process Comparison
kepler_process_<package|core|dram|platform|other|uncore>{latest}
metrics tokepler_process_<package|core|dram|platform|other|uncore>{dev}
Kepler Node Comparison
kepler_node_<package|core|dram|platform|other|uncore>{latest}
againstkepler_node_<package|core|dram|platform|other|uncore>{dev}
Additionally, a stressor script has been added to include system load, allowing for real-time validation of Kepler under stress conditions.