openshift / cluster-etcd-operator

Operator to manage the lifecycle of the etcd members of an OpenShift cluster
Apache License 2.0
96 stars 130 forks source link

OCPBUGS-36604: tune event correlation #1310

Closed tjungblu closed 3 months ago

tjungblu commented 3 months ago

The event correlator is too strict and throttles away important events for our invariant test suite. This fine tunes the values to try not to lose those.

openshift-ci-robot commented 3 months ago

@tjungblu: This pull request references Jira Issue OCPBUGS-36604, which is invalid:

Comment /jira refresh to re-evaluate validity if changes to the Jira bug are made, or edit the title of this pull request to link to a different bug.

The bug has been updated to refer to the pull request using the external bug tracker.

In response to [this](https://github.com/openshift/cluster-etcd-operator/pull/1310): >The event correlator is too strict and throttles away important events for our invariant test suite. This fine tunes the values to try not to lose those. Instructions for interacting with me using PR comments are available [here](https://prow.ci.openshift.org/command-help?repo=openshift%2Fcluster-etcd-operator). If you have questions or suggestions related to my behavior, please file an issue against the [openshift-eng/jira-lifecycle-plugin](https://github.com/openshift-eng/jira-lifecycle-plugin/issues/new) repository.
Elbehery commented 3 months ago

/lgtm /approve

tjungblu commented 3 months ago

/hold

openshift-ci[bot] commented 3 months ago

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: Elbehery, tjungblu

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files: - ~~[OWNERS](https://github.com/openshift/cluster-etcd-operator/blob/master/OWNERS)~~ [Elbehery,tjungblu] Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment
tjungblu commented 3 months ago

/retest

tjungblu commented 3 months ago

nope, still a thing

{ static pod lifecycle failure - static pod: "etcd" in namespace: "openshift-etcd" for revision: 3 on node: "ip-10-0-35-140.ec2.internal" didn't show up, waited: 3m30s}

openshift-ci[bot] commented 3 months ago

New changes are detected. LGTM label has been removed.

openshift-ci[bot] commented 3 months ago

@tjungblu: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/e2e-aws-etcd-certrotation 5b9e04be863fec1388805cdaa15dfe78efef07a3 link false /test e2e-aws-etcd-certrotation
ci/prow/e2e-aws-etcd-recovery 5b9e04be863fec1388805cdaa15dfe78efef07a3 link false /test e2e-aws-etcd-recovery
ci/prow/e2e-metal-ovn-sno-cert-rotation-shutdown 5b9e04be863fec1388805cdaa15dfe78efef07a3 link false /test e2e-metal-ovn-sno-cert-rotation-shutdown
ci/prow/e2e-agnostic-ovn 5b9e04be863fec1388805cdaa15dfe78efef07a3 link true /test e2e-agnostic-ovn
ci/prow/e2e-metal-ovn-ha-cert-rotation-shutdown 5b9e04be863fec1388805cdaa15dfe78efef07a3 link false /test e2e-metal-ovn-ha-cert-rotation-shutdown

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available [here](https://git.k8s.io/community/contributors/guide/pull-requests.md). If you have questions or suggestions related to my behavior, please file an issue against the [kubernetes-sigs/prow](https://github.com/kubernetes-sigs/prow/issues/new?title=Prow%20issue:) repository. I understand the commands that are listed [here](https://go.k8s.io/bot-commands).
openshift-ci-robot commented 3 months ago

@tjungblu: This pull request references Jira Issue OCPBUGS-36604. The bug has been updated to no longer refer to the pull request using the external bug tracker. All external bug links have been closed. The bug has been moved to the NEW state.

In response to [this](https://github.com/openshift/cluster-etcd-operator/pull/1310): >The event correlator is too strict and throttles away important events for our invariant test suite. This fine tunes the values to try not to lose those. Instructions for interacting with me using PR comments are available [here](https://prow.ci.openshift.org/command-help?repo=openshift%2Fcluster-etcd-operator). If you have questions or suggestions related to my behavior, please file an issue against the [openshift-eng/jira-lifecycle-plugin](https://github.com/openshift-eng/jira-lifecycle-plugin/issues/new) repository.