Stress commands were been running about a day, when nemesis AddDropColumn add new column to table with query:
'ALTER TABLE standard1 ADD ( YZYZVOI2RR map<timestamp,varchar>, NEWOPXEEHQ double );'
After that, while new schema was synced between nodes, node5 start reporting a lot of errors and warnings:
< t:2021-07-23 23:29:09,555 f:base.py l:222 c:RemoteCmdRunner p:DEBUG > com.datastax.driver.core.exceptions.ReadTimeoutException: Cassandra timeout during read query at consistency QUORUM (2 responses were required but only 1 replica responded)
< t:2021-07-23 23:29:09,559 f:base.py l:222 c:RemoteCmdRunner p:DEBUG > java.io.IOException: Operation x10 on key(s) [373138504d354c313130]: Error executing: (ReadTimeoutException): Cassandra timeout during read query at consistency QUORUM (2 responses were required but only 1 replica responded)
< t:2021-07-23 23:29:09,559 f:base.py l:222 c:RemoteCmdRunner p:DEBUG > com.datastax.driver.core.exceptions.WriteFailureException: Cassandra failure during write query at consistency QUORUM (2 responses were required but only 1 replica responded, 1 failed)
Installation details Scylla version (or git commit hash): 4.6.dev-0.20210720.dcd05f77b with build-id 544ca11fed80ea34a94eaf292290c0920faec3d0 Cluster size: 6 nodes OS (RHEL/CentOS/Ubuntu/AWS AMI): ami-090bdceda0999b246(eu-north-1)
Test: longevity-50gb-3days-test Test config file: longevity-50GB-3days-authorization-and-tls-ssl.yaml
Issue description Next c-s commands are used to generate dataset and workload during test: prepare_write_cmd:
stress_cmd:
Stress commands were been running about a day, when nemesis AddDropColumn add new column to table with query:
'ALTER TABLE standard1 ADD ( YZYZVOI2RR map<timestamp,varchar>, NEWOPXEEHQ double );'
After that, while new schema was synced between nodes, node5 start reporting a lot of errors and warnings:
And c-s read command got next error:
After that c-s command terminated.
Live monitor: http://13.53.214.97:3000/d/m2j9TKZnk/longevity-50gb-3days-scylla-per-server-metrics-nemesis-master?orgId=1&from=1626985117320&to=1627283091954&var-by=instance&var-cluster=&var-dc=All&var-node=All&var-shard=All&var-sct_tags=DisruptionEvent&var-sct_tags=CoreDumpEvent
Nodes:
Restore Monitor Stack command:
$ hydra investigate show-monitor 1703d316-469f-477b-91ef-dcc3cd3268d7
Show all stored logs command:$ hydra investigate show-logs 1703d316-469f-477b-91ef-dcc3cd3268d7
Test id:
1703d316-469f-477b-91ef-dcc3cd3268d7
Logs: grafana - https://cloudius-jenkins-test.s3.amazonaws.com/1703d316-469f-477b-91ef-dcc3cd3268d7/20210723_233850/grafana-screenshot-longevity-50gb-3days-scylla-per-server-metrics-nemesis-20210723_234143-longevity-tls-50gb-3d-master-monitor-node-1703d316-1.png grafana - https://cloudius-jenkins-test.s3.amazonaws.com/1703d316-469f-477b-91ef-dcc3cd3268d7/20210723_233850/grafana-screenshot-overview-20210723_233850-longevity-tls-50gb-3d-master-monitor-node-1703d316-1.png db-cluster - https://cloudius-jenkins-test.s3.amazonaws.com/1703d316-469f-477b-91ef-dcc3cd3268d7/20210726_042434/db-cluster-1703d316.tar.gz loader-set - https://cloudius-jenkins-test.s3.amazonaws.com/1703d316-469f-477b-91ef-dcc3cd3268d7/20210726_042434/loader-set-1703d316.tar.gz
Jenkins job URL