kubernetes-sigs / cluster-api

Home for Cluster API, a subproject of sig-cluster-lifecycle
https://cluster-api.sigs.k8s.io
Apache License 2.0
3.5k stars 1.3k forks source link

flaking unit test in `TestReconcileMachinePoolMachines` #11070

Open cahillsf opened 4 weeks ago

cahillsf commented 4 weeks ago

Which jobs are flaking?

these failures are apparent in periodic-cluster-api-test-mink8s-main and periodic-cluster-api-test-main

Which tests are flaking?

TestReconcileMachinePoolMachines/Reconcile_MachinePool_Machines/Should_create_two_machines_if_two_infra_machines_exist

Since when has it been flaking?

at least since 20214-07-06: https://storage.googleapis.com/k8s-triage/index.html?date=2024-07-20&text=TestReconcileMachinePoolMachines%2FReconcile_MachinePool_Machines%2FShould_create_two_machines_if_two_infra_machines_exist&job=.*cluster-api.*(test%7Ce2e)-(mink8s-)*main&xjob=.*-provider-.*

Testgrid link

https://prow.k8s.io/view/gs/kubernetes-jenkins/logs/periodic-cluster-api-test-mink8s-main/1824877164462346240

Reason for failure (if possible)

No response

Anything else we need to know?

No response

Label(s) to be applied

/kind flake One or more /area label. See https://github.com/kubernetes-sigs/cluster-api/labels?q=area for the list of labels.

cahillsf commented 4 weeks ago

/area machinepool

sbueringer commented 3 weeks ago

Yup. I saw a bunch of flakes around MachinePool unit tests as well

/triage accepted

/help

k8s-ci-robot commented 3 weeks ago

@sbueringer: This request has been marked as needing help from a contributor.

Guidelines

Please ensure that the issue body includes answers to the following questions:

For more details on the requirements of such an issue, please see here and ensure that they are met.

If this request no longer meets these requirements, the label can be removed by commenting with the /remove-help command.

In response to [this](https://github.com/kubernetes-sigs/cluster-api/issues/11070): >Yup. I saw a bunch of flakes around MachinePool unit tests as well > >/triage accepted > >/help Instructions for interacting with me using PR comments are available [here](https://git.k8s.io/community/contributors/guide/pull-requests.md). If you have questions or suggestions related to my behavior, please file an issue against the [kubernetes-sigs/prow](https://github.com/kubernetes-sigs/prow/issues/new?title=Prow%20issue:) repository.
cahillsf commented 2 weeks ago

/assign cahillsf

cannot reproduce this issue locally, have opened a draft that seems to use preferred methods in this unit test, see PR for details. hopefully this will improve the stability of this test

sbueringer commented 2 weeks ago

Would be great if some folks familiar with Machine Pools / MachinePool Machines can review #11124

(cc @Jont828 @willie-yao)

sbueringer commented 2 weeks ago

/reopen

I assume we want to keep this issue open for now as we're not sure if the PR will fix all flakes

k8s-ci-robot commented 2 weeks ago

@sbueringer: Reopened this issue.

In response to [this](https://github.com/kubernetes-sigs/cluster-api/issues/11070#issuecomment-2325102621): >/reopen > >I assume we want to keep this issue open for now as we're not sure if it will fix all flakes Instructions for interacting with me using PR comments are available [here](https://git.k8s.io/community/contributors/guide/pull-requests.md). If you have questions or suggestions related to my behavior, please file an issue against the [kubernetes-sigs/prow](https://github.com/kubernetes-sigs/prow/issues/new?title=Prow%20issue:) repository.
cahillsf commented 2 weeks ago

/reopen

I assume we want to keep this issue open for now as we're not sure if the PR will fix all flakes

Yep sounds good, will track the test and revisit


edit: adding k8s-triage link https://storage.googleapis.com/k8s-triage/index.html?text=TestReconcileMachinePoolMachines&job=.*cluster-api-(test%7Ce2e)-(mink8s-)*main