Open hpedrorodrigues opened 2 months ago
Hello! Thank you for filing an issue.
The maintainers will triage your issue shortly.
In the meantime, please take a look at the troubleshooting guide for bug reports.
If this is a feature request, please review our contribution guidelines.
I get the same result from canceled runs or when the runner pods failed.
I implemented a bit of a hacky fix by putting parameters in Grafana to ignore certain values above a threshold, but agree that it should be 0 for those runs.
I get the same result and like @Lucas-Hughes it seems to happen when the jobs are cancelled. That's too bad since this metrics is super valuable since we can create alerts to detect slower than usual github jobs ....
Checks
Controller Version
0.9.3
Deployment Method
Helm
Checks
To Reproduce
Describe the bug
In a few cases (don't know exact reason yet) the listener reports the metric
gha_job_execution_duration_seconds_sum
with a wrong value.Example:
Looking at the repository, all runs take less than 60 seconds to finish. The other ones are canceled even before starting because the branch has a new commit.
Describe the expected behavior
Not sure if this is caused only by canceled runs, but I'd expect the listener to return 0 for such runs.
Additional Context
Controller Logs
Runner Pod Logs