Open dimakr opened 5 days ago
I did a quick check, in cloud usage reports, of how many QA instances we were having running in Azure during a few recent 2024.1 patch release testing phases (was checking only dates for 2024.1 releases, as the issue occurred during 2024.1.11 testing).
Usually it is executing on Sunday and we have 15-20 running QA instances (at the time the cloud usage report is collected), but on 2024-09-29, when this issue occurred, we had 31 running QA instance (and 43 total). So maybe its just a coincidence, when that many instances were running at the same time. Or maybe some new tests/configs were added for Azure and we need to increase quota for cores
2 out of 3 builds of rolling-upgrade-azure-image-test test for 2024.1.11 patch release failed on provisioning db instances with the following error:
Installation details
Cluster size: 4 nodes (Standard_L8s_v3)
Scylla Nodes used in this run: No resources left at the end of the run
OS / Image:
/CommunityGalleries/scylladb-7e8d8a04-23db-487d-87ec-0e175c0615bb/Images/scylla-enterprise-2023.1/Versions/2023.1.11
(azure: undefined_region)Test:
rolling-upgrade-azure-image-test
Test id:a2350a9d-188d-4cfa-856b-716dea14cf91
Test name:enterprise-2024.1/rolling-upgrade/rolling-upgrade-azure-image-test
Test method:upgrade_test.UpgradeTest.test_rolling_upgrade
Test config file(s):Logs and commands
- Restore Monitor Stack command: `$ hydra investigate show-monitor a2350a9d-188d-4cfa-856b-716dea14cf91` - Restore monitor on AWS instance using [Jenkins job](https://jenkins.scylladb.com/view/QA/job/QA-tools/job/hydra-show-monitor/parambuild/?test_id=a2350a9d-188d-4cfa-856b-716dea14cf91) - Show all stored logs command: `$ hydra investigate show-logs a2350a9d-188d-4cfa-856b-716dea14cf91` ## Logs: - **sct-runner-events-a2350a9d.tar.gz** - [https://cloudius-jenkins-test.s3.amazonaws.com/a2350a9d-188d-4cfa-856b-716dea14cf91/20240929_051823/sct-runner-events-a2350a9d.tar.gz](https://cloudius-jenkins-test.s3.amazonaws.com/a2350a9d-188d-4cfa-856b-716dea14cf91/20240929_051823/sct-runner-events-a2350a9d.tar.gz) - **sct-a2350a9d.log.tar.gz** - [https://cloudius-jenkins-test.s3.amazonaws.com/a2350a9d-188d-4cfa-856b-716dea14cf91/20240929_051823/sct-a2350a9d.log.tar.gz](https://cloudius-jenkins-test.s3.amazonaws.com/a2350a9d-188d-4cfa-856b-716dea14cf91/20240929_051823/sct-a2350a9d.log.tar.gz) [Jenkins job URL](https://jenkins.scylladb.com/job/enterprise-2024.1/job/rolling-upgrade/job/rolling-upgrade-azure-image-test/22/) [Argus](https://argus.scylladb.com/test/553d2d9e-b170-4ccd-b5c6-44f0673ea2d1/runs?additionalRuns[]=a2350a9d-188d-4cfa-856b-716dea14cf91)