Closed kannon92 closed 2 months ago
@kannon92: This pull request explicitly references no jira issue.
Thanks
/lgtm
/lgtm
[APPROVALNOTIFIER] This PR is APPROVED
This pull-request has been approved by: kannon92, stbenjam
The full list of commands accepted by this bot can be found here.
The pull request process is described here
Good time to consider %d is considered a flake for now
as well?
https://github.com/openshift/origin/pull/29108#issuecomment-2360961536
@neisw I don't agree with that one.
https://github.com/openshift/origin/pull/29108#issuecomment-2360987842
/hold
wait on comments for @neisw
I think @stbenjam main concern was the potential to rename later. But it seems atypical to reference 'flake' in the test name. Looking at sippy for 4.18 tests containing flake it looks like only [sig-architecture] platform pods in ns/%s that restart more than %d is considered a flake for now
variety.
We have done things in the past like this where we flake at first then fail some / flake others later on. Just something to consider if it is still early enough to tweak.
Question about failures we see in presubmits
Is the following an expected failure when the pod is deleted?
namespace/openshift-image-registry node/ip-10-0-20-204.us-west-1.compute.internal pod/node-ca-v94n4 uid/cc459ba2-bb39-482e-891e-fc3ae2a23056 container/node-ca restarted 4 times at:
non-zero exit at 2024-09-19 01:56:13.670067226 +0000 UTC m=+919.297863990: cause/ContainerStatusUnknown code/137 reason/ContainerExit The container could not be located when the pod was deleted. The container used to be Running
non-zero exit at 2024-09-19 02:10:18.40417271 +0000 UTC m=+1764.031969474: cause/ContainerStatusUnknown code/137 reason/ContainerExit The container could not be located when the pod was deleted. The container used to be Running
non-zero exit at 2024-09-19 02:41:34.783221583 +0000 UTC m=+3640.411018367: cause/ContainerStatusUnknown code/137 reason/ContainerExit The container could not be located when the pod was deleted. The container used to be Running
non-zero exit at 2024-09-19 03:07:16.927074536 +0000 UTC m=+5182.554871310: cause/ContainerStatusUnknown code/137 reason/ContainerExit The container could not be located when the pod was deleted. The container used to be Running
Wondering if this might be a case we want to start with flakes and observe the failures first?
Wondering if this might be a case we want to start with flakes and observe the failures first?
already did. This was merged a few weeks back and exceptions added based on ci.search.
/close
PR needs rebase.
@kannon92: Closed this PR.
@kannon92: The following tests failed, say /retest
to rerun all failed tests or /retest-required
to rerun all mandatory failed tests:
Test name | Commit | Details | Required | Rerun command |
---|---|---|---|---|
ci/prow/e2e-openstack-ovn | ee7b16e3255b2b864ffa6f294dc5b0d354301434 | link | false | /test e2e-openstack-ovn |
ci/prow/e2e-aws-ovn-single-node-upgrade | ee7b16e3255b2b864ffa6f294dc5b0d354301434 | link | false | /test e2e-aws-ovn-single-node-upgrade |
ci/prow/e2e-aws-ovn-kube-apiserver-rollout | ee7b16e3255b2b864ffa6f294dc5b0d354301434 | link | false | /test e2e-aws-ovn-kube-apiserver-rollout |
Full PR test history. Your PR dashboard.
cc @stbenjam