Open yangchiu opened 2 days ago
@mantissahz Please help investigate the issue. Thank you.
Could this be related to https://github.com/longhorn/longhorn/issues/9781?
Could this be related to #9781?
Yes, it seems to be a regression failure caused by it. I will handle this at https://github.com/longhorn/longhorn/issues/9781.
cc @derekbit @mantissahz
[x] Where is the reproduce steps/test steps documented? The reproduce steps/test steps are at:/
[x] ~Is there a workaround for the issue? If so, where is it documented? The workaround is at:~
[x] Does the PR include the explanation for the fix or the feature?
[x] ~Does the PR include deployment change (YAML/Chart)? If so, where are the PRs for both YAML file and Chart?~ The PR for the YAML change is at: The PR for the chart change is at:
[ ] Have the backend code been merged (Manager, Engine, Instance Manager, BackupStore etc) (including backport-needed/*
)?
The PR is at:
[x] Which areas/issues this PR might have potential impacts on? Area replica scheduling, node eviction Issues
[x] ~If labeled: require/LEP Has the Longhorn Enhancement Proposal PR submitted?~ The LEP PR is at
[x] ~If labeled: area/ui Has the UI issue filed or ready to be merged (including backport-needed/*
)?~
The UI issue/PR is at
[x] ~If labeled: require/doc Has the necessary document PR submitted or merged (including backport-needed/*
)?~
The documentation issue/PR is at
[x] ~If labeled: require/automation-e2e Has the end-to-end test plan been merged? Have QAs agreed on the automation test case? If only test case skeleton w/o implementation, have you created an implementation issue (including backport-needed/*
)~
The automation skeleton PR is at
The automation test case PR is at
The issue of automation test case implementation is at (please create by the template)
[x] ~If labeled: require/automation-engine Has the engine integration test been merged (including backport-needed/*
)?~
The engine automation PR is at
[x] ~If labeled: require/manual-test-plan Has the manual test plan been documented?~ The updated manual test plan is at
[x] ~If the fix introduces the code for backward compatibility Has a separate issue been filed with the label release/obsolete-compatibility
?~
The compatibility issue is filed at
Could this be related to #9781?
Yes, it seems to be a regression failure caused by it. I will handle this at #9781.
cc @derekbit @mantissahz
so this is not a regression in the existing versions but caused by the recent fix for #9781 ?
so this is not a regression in the existing versions but caused by the recent fix for #9781 ?
Yes, this is caused by a recently merged PR. https://github.com/longhorn/longhorn-manager/pull/3270
Describe the bug
Test case test_node_eviction_multiple_volume failed to reschedule replicas after volume detached:
https://ci.longhorn.io/job/public/job/master/job/sles/job/amd64/job/longhorn-tests-sles-amd64/1104/testReport/junit/tests/test_node/test_node_eviction_multiple_volume/
To Reproduce
In
v1.7.2
, the detached volume will automatically re-attach in step 9 to reschedule replicas from node 1 to node 2.But in
master-head
, the re-attachment and rescheduling never happen.Expected behavior
Support bundle for troubleshooting
Environment
Additional context
Workaround and Mitigation