-
**How to categorize this issue?**
/area control-plane
/kind enhancement
/priority 3
**What would you like to be added**:
DWD today checks if the % of expired node leases is above a configure…
-
- [ ] Health Status Checker ([chat](https://sourcegraph.slack.com/archives/C02E4HE42BX/p1663698348898329), [doc](https://docs.google.com/document/d/1shS8DZkZXMB-T4rKwLMogU70N4mSyOpDG1G7ElkjHD4/edit#he…
-
This idea came up when I was writing a configuration file that uses the `subprojects` section. For each of my subprojects, I was defining an alternate `output_dir` and `sample_annotation`, but I was n…
-
Now that Solution Checker is an accepted part of the ALM process, I am finding more and more situations where a clean bill of health is expected from solution checker as part of the release pipeline. …
-
HealthBit version: 0.1.8
Currently only the first failed check is returned. It would be nice to have an option to continue beyond the first failed check and report all failures.
-
**Describe the bug**
I have multicluster setup with separate monitoring cluster. For metrics querying i use Thanos Query and it works fine in-cluster robusta runner can connect through thanos query s…
-
Instead of handling these events in each service there should be a module that takes care of it.
This should handle the transition between run status.
Include generic logic such as the pause functio…
-
The following links should probably be deleted and/or replaced with alternatives:
https://github.com/DanTheMan827/ios-app-signer
https://github.com/DerekSelander/dsdump
https://github.com/Domilop…
-
We are suddenly seing some errors in our workflows with the following messages:
```
Error uploading to https://github-production-release-asset-xxxxxx.s3.amazonaws.com: 403
```
Nothing has chan…
-
## Describe the bug
The health check loop starts before a service's start-up sequence which means it logs an error on start-up and reports the service as down. Since monitoring is based on the aver…