Open mrniranjan opened 2 weeks ago
@mrniranjan: This pull request references Jira Issue OCPBUGS-39379, which is valid. The bug has been moved to the POST state.
Requesting review from QA contact: /cc @mrniranjan
The bug has been updated to refer to the pull request using the external bug tracker.
@openshift-ci-robot: GitHub didn't allow me to request PR reviews from the following users: mrniranjan.
Note that only openshift members and repo collaborators can review this PR, and authors cannot review their own PRs.
Requesting review from QA contact: /cc @mrniranjan
The bug has been updated to refer to the pull request using the external bug tracker.
In response to [this](https://github.com/openshift/cluster-node-tuning-operator/pull/1153): >Automates OCPBUGS-34812: cgroupsv2: failed to write on cpuset.cpus.exclusive > >To reproduce the bug, we need to create and delete deployment(deploying guaranteed pods with cpu load balancing annotation) in quick succession, so that we do not fully wait for the cleanup causing the pod about to be deleted to still have access to exclusive cpus causing the new pod to fail because cpuset.cpus.exclusive is not yet freed. > >As the pre-start hook fails to write to cpuset.cpus.exclusive file in the pods cgroup pod goes to RunContainerError state. > >This automation PR creates and deletes deployment in loop to reproduce the issue and checks if the pods fails with Runtime error with message "failed to run pre-start hook for container" > >Manual backport of PR#1127 Instructions for interacting with me using PR comments are available [here](https://prow.ci.openshift.org/command-help?repo=openshift%2Fcluster-node-tuning-operator). If you have questions or suggestions related to my behavior, please file an issue against the [openshift-eng/jira-lifecycle-plugin](https://github.com/openshift-eng/jira-lifecycle-plugin/issues/new) repository.
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.
@mrniranjan: This pull request references Jira Issue OCPBUGS-39379, which is valid.
Requesting review from QA contact: /cc @mrniranjan
The bug has been updated to refer to the pull request using the external bug tracker.
@openshift-ci-robot: GitHub didn't allow me to request PR reviews from the following users: mrniranjan.
Note that only openshift members and repo collaborators can review this PR, and authors cannot review their own PRs.
Requesting review from QA contact: /cc @mrniranjan
The bug has been updated to refer to the pull request using the external bug tracker.
In response to [this](https://github.com/openshift/cluster-node-tuning-operator/pull/1153): >Automates OCPBUGS-34812: cgroupsv2: failed to write on cpuset.cpus.exclusive > >To reproduce the bug, we need to create and delete deployment(deploying guaranteed pods with cpu load balancing annotation) in quick succession, so that we do not fully wait for the cleanup causing the pod about to be deleted to still have access to exclusive cpus causing the new pod to fail because cpuset.cpus.exclusive is not yet freed. > >As the pre-start hook fails to write to cpuset.cpus.exclusive file in the pods cgroup pod goes to RunContainerError state. > >This automation PR creates and deletes deployment in loop to reproduce the issue and checks if the pods fails with Runtime error with message "failed to run pre-start hook for container" > >Manual backport of #1127 Instructions for interacting with me using PR comments are available [here](https://prow.ci.openshift.org/command-help?repo=openshift%2Fcluster-node-tuning-operator). If you have questions or suggestions related to my behavior, please file an issue against the [openshift-eng/jira-lifecycle-plugin](https://github.com/openshift-eng/jira-lifecycle-plugin/issues/new) repository.
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.
@mrniranjan: all tests passed!
Full PR test history. Your PR dashboard.
/approve /label backport-risk-assessed
[APPROVALNOTIFIER] This PR is APPROVED
This pull-request has been approved by: mrniranjan, yanirq
The full list of commands accepted by this bot can be found here.
The pull request process is described here
/lgtm
@mrniranjan: you cannot LGTM your own PR.
/label cherry-pick-approved
Automates OCPBUGS-34812: cgroupsv2: failed to write on cpuset.cpus.exclusive
To reproduce the bug, we need to create and delete deployment(deploying guaranteed pods with cpu load balancing annotation) in quick succession, so that we do not fully wait for the cleanup causing the pod about to be deleted to still have access to exclusive cpus causing the new pod to fail because cpuset.cpus.exclusive is not yet freed.
As the pre-start hook fails to write to cpuset.cpus.exclusive file in the pods cgroup pod goes to RunContainerError state.
This automation PR creates and deletes deployment in loop to reproduce the issue and checks if the pods fails with Runtime error with message "failed to run pre-start hook for container"
Manual backport of #1127