Open juliayakovlev opened 1 year ago
@juliayakovlev maybe adding proper EventFilter to decommission (similar like we do in DropIndex)?
@juliayakovlev
just exclude the node with have running_nemesis
, for the check (adding back the target_node)
@juliayakovlev just exclude the node with have
running_nemesis
, for the check (adding back the target_node)
It won't let up 100% cover. If SLA nemesis will start during decommission (or similar) nemesis and finish after that we won't know about the nemesis (validation is performs in the end)
Scheduler runtime validation failed on node that unbootstraped (during decommission). How we can filter such nodes and not validate it there?
Issue description
Describe your issue in detail and steps it took to produce it.
Impact
Describe the impact this issue causes to the user.
How frequently does it reproduce?
Describe the frequency with how this issue can be reproduced.
Installation details
Kernel Version: 5.15.0-1039-aws Scylla version (or git commit hash):
2022.2.11-20230705.27d29485de90
with build-idf467a0ad8869d61384d8bbc8f20e4fb8fd281f4b
Cluster size: 5 nodes (i4i.2xlarge)
Scylla Nodes used in this run:
OS / Image:
ami-0ce59e86771bcb0ef
(aws: undefined_region)Test:
longevity-sla-system-24h
Test id:0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3
Test name:enterprise-2022.2/Reproducers/longevity-sla-system-24h
Test config file(s):Logs and commands
- Restore Monitor Stack command: `$ hydra investigate show-monitor 0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3` - Restore monitor on AWS instance using [Jenkins job](https://jenkins.scylladb.com/view/QA/job/QA-tools/job/hydra-show-monitor/parambuild/?test_id=0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3) - Show all stored logs command: `$ hydra investigate show-logs 0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3` ## Logs: - **db-cluster-0ceb3cea.tar.gz** - [https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/db-cluster-0ceb3cea.tar.gz](https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/db-cluster-0ceb3cea.tar.gz) - **sct-runner-events-0ceb3cea.tar.gz** - [https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/sct-runner-events-0ceb3cea.tar.gz](https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/sct-runner-events-0ceb3cea.tar.gz) - **sct-0ceb3cea.log.tar.gz** - [https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/sct-0ceb3cea.log.tar.gz](https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/sct-0ceb3cea.log.tar.gz) - **loader-set-0ceb3cea.tar.gz** - [https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/loader-set-0ceb3cea.tar.gz](https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/loader-set-0ceb3cea.tar.gz) - **monitor-set-0ceb3cea.tar.gz** - [https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/monitor-set-0ceb3cea.tar.gz](https://cloudius-jenkins-test.s3.amazonaws.com/0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3/20230814_095238/monitor-set-0ceb3cea.tar.gz) [Jenkins job URL](https://jenkins.scylladb.com/job/enterprise-2022.2/job/Reproducers/job/longevity-sla-system-24h/9/) [Argus](https://argus.scylladb.com/test/c3dd6458-2baf-4938-a03f-370b6365470c/runs?additionalRuns[]=0ceb3cea-2cc6-46f1-9d48-95e3cde45bb3)