Open yaronkaikov opened 1 year ago
/cc @Annamikhlin
how those are connected to each ? can you explain ? how scylla bug makes SCT to not create results file ?
SCT code shouldn't be failing in reporting results, we should fix the fallback to have something like N/A, but still send out of the email and report to argus
< t:2022-11-14 11:45:19,120 f:events_processes.py l:147 c:sdcm.sct_events.events_processes p:DEBUG > Get process `EVENTS_FILE_LOGGER' from EventsProcessesRegistry[lod_dir=/home/jenkins/slave/workspace/scylla-5.0/artifacts-offline-install/artifacts-ubuntu2204-nonroot-test/scylla-cluster-tests/20221114-113226-342996,id=0x7f712738ab00,default=True]
< t:2022-11-14 11:45:19,120 f:remote_base.py l:520 c:RemoteLibSSH2CmdRunner p:DEBUG > Running command "/home/scylla-test/scylladb/bin/scylla --version"...
< t:2022-11-14 11:45:19,623 f:base.py l:228 c:RemoteLibSSH2CmdRunner p:DEBUG > FATAL: Exception during startup, aborting: std::runtime_error (Could not setup Async I/O: Resource temporarily unavailable. The most common cause is not enough request capacity in /proc/sys/fs/aio-max-nr. Try increasing that number or reducing the amount of logical CPUs available for your application)
< t:2022-11-14 11:45:19,623 f:cluster.py l:2092 c:sdcm.cluster_gce p:DEBUG > Node artifacts-ubuntu2204-jenkins-db-node-25f46237-0-1 [34.73.109.90 | 10.142.0.123] (seed: True): Unable to get ScyllaDB version using `/home/scylla-test/scylladb/bin/scylla --version':
< t:2022-11-14 11:45:19,623 f:cluster.py l:2092 c:sdcm.cluster_gce p:DEBUG >
< t:2022-11-14 11:45:19,623 f:cluster.py l:2092 c:sdcm.cluster_gce p:DEBUG > FATAL: Exception during startup, aborting: std::runtime_error (Could not setup Async I/O: Resource temporarily unavailable. The most common cause is not enough request capacity in /proc/sys/fs/aio-max-nr. Try increasing that number or reducing the amount of logical CPUs available for your application)
< t:2022-11-14 11:45:19,623 f:remote_base.py l:520 c:RemoteLibSSH2CmdRunner p:DEBUG > Running command "dpkg-query --show --showformat '${Version}' scylla"...
< t:2022-11-14 11:45:20,166 f:base.py l:228 c:RemoteLibSSH2CmdRunner p:DEBUG > dpkg-query: no packages found matching scylla
< t:2022-11-14 11:45:20,167 f:cluster.py l:2104 c:sdcm.cluster_gce p:DEBUG > Node artifacts-ubuntu2204-jenkins-db-node-25f46237-0-1 [34.73.109.90 | 10.142.0.123] (seed: True): Unable to get ScyllaDB version using `dpkg-query --show --showformat '${Version}' scylla':
< t:2022-11-14 11:45:20,167 f:cluster.py l:2104 c:sdcm.cluster_gce p:DEBUG >
< t:2022-11-14 11:45:20,167 f:cluster.py l:2104 c:sdcm.cluster_gce p:DEBUG > dpkg-query: no packages found matching scylla
< t:2022-11-14 11:45:20,167 f:cluster.py l:2114 c:sdcm.cluster_gce p:WARNING > Node artifacts-ubuntu2204-jenkins-db-node-25f46237-0-1 [34.73.109.90 | 10.142.0.123] (seed: True): All attempts to get ScyllaDB version failed. Looks like there is no ScyllaDB installed.
< t:2022-11-14 11:45:20,168 f:events_processes.py l:147 c:sdcm.sct_events.events_processes p:DEBUG > Get process `EVENTS_FILE_LOGGER' from EventsProcessesRegistry[lod_dir=/home/jenkins/slave/workspace/scylla-5.0/artifacts-offline-install/artifacts-ubuntu2204-nonroot-test/scylla-cluster-tests/20221114-113226-342996,id=0x7f712738ab00,default=True]
< t:2022-11-14 11:45:20,168 f:events_processes.py l:147 c:sdcm.sct_events.events_processes p:DEBUG > Get process `EVENTS_FILE_LOGGER' from EventsProcessesRegistry[lod_dir=/home/jenkins/slave/workspace/scylla-5.0/artifacts-offline-install/artifacts-ubuntu2204-nonroot-test/scylla-cluster-tests/20221114-113226-342996,id=0x7f712738ab00,default=True]
< t:2022-11-14 11:45:20,169 f:events_processes.py l:147 c:sdcm.sct_events.events_processes p:DEBUG > Get process `EVENTS_FILE_LOGGER' from EventsProcessesRegistry[lod_dir=/home/jenkins/slave/workspace/scylla-5.0/artifacts-offline-install/artifacts-ubuntu2204-nonroot-test/scylla-cluster-tests/20221114-113226-342996,id=0x7f712738ab00,default=True]
< t:2022-11-14 11:45:20,169 f:tester.py l:3263 c:ArtifactsTest p:ERROR > Error while saving email data. Error: argument of type 'NoneType' is not iterable
@fruch , shoud #5417 fix this issue now?
@fruch , shoud #5417 fix this issue now?
I don't think so...
Why do we have empty results file?
In this specific case it was cause we failed to get the scylla version
we are trying to get via scylla --version
and fallback to using yum/apt
In this one, it's installing from relocate package, and both methods failed, which in turn fails the creation of the email results file (and also the argus reporting, before Alex fixed it)
All our reporting code is a bit fragile, and any failing in getting some part of info, shouldn't fail the whole report, we should publish an event with the error so it would be noticed, but still carry on and send the results we do have.
Seen both in 5.0 and 2022.1 (maybe some missing backport:?)
https://jenkins.scylladb.com/job/scylla-5.0/job/artifacts-offline-install/job/artifacts-ubuntu2204-nonroot-test/ is failing for almost 2 month on sending emails. this actually fail the entire pipeline Jenkins job