redpanda-data / redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
https://redpanda.com
9.22k stars 564 forks source link

CI Failure (key symptom) in `TieredStorageIoStressTest.test_io_stress` #19909

Open vbotbuildovich opened 1 month ago

vbotbuildovich commented 1 month ago

https://buildkite.com/redpanda/vtools/builds/14778

Module: rptest.scale_tests.tiered_storage_io_stress_test
Class: TieredStorageIoStressTest
Method: test_io_stress
Arguments: {
    "segment_size": 1048576,
    "interval_uploads": true,
    "cloud_storage_type": 1
}
test_id:    TieredStorageIoStressTest.test_io_stress
status:     FAIL
run time:   472.413 seconds

<BadLogLines nodes=ip-172-31-5-48(4) example="ERROR 2024-06-17 14:12:58,563 [shard 0:fetc] cloud_storage - cache_service.cc:708 - trim: couldn't delete /var/lib/redpanda/data/cloud_storage_cache/e30a96c2/kafka/topic-yhtponiaak/61_25/10-11-2000292-1-v1.log.1_chunks/0: filesystem error: remove failed: No such file or directory ["/var/lib/redpanda/data/cloud_storage_cache/e30a96c2/kafka/topic-yhtponiaak/61_25/10-11-2000292-1-v1.log.1_chunks/0"].">
Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 184, in _do_run
    data = self.run_test()
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 276, in run_test
    return self.test_context.function(self.test)
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/mark/_mark.py", line 535, in wrapper
    return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
  File "/home/ubuntu/redpanda/tests/rptest/services/cluster.py", line 190, in wrapped
    redpanda.raise_on_bad_logs(
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda.py", line 1565, in raise_on_bad_logs
    lsearcher.search_logs(_searchable_nodes)
  File "/home/ubuntu/redpanda/tests/rptest/services/utils.py", line 197, in search_logs
    raise BadLogLines(bad_loglines)
rptest.services.utils.BadLogLines: <BadLogLines nodes=ip-172-31-5-48(4) example="ERROR 2024-06-17 14:12:58,563 [shard 0:fetc] cloud_storage - cache_service.cc:708 - trim: couldn't delete /var/lib/redpanda/data/cloud_storage_cache/e30a96c2/kafka/topic-yhtponiaak/61_25/10-11-2000292-1-v1.log.1_chunks/0: filesystem error: remove failed: No such file or directory ["/var/lib/redpanda/data/cloud_storage_cache/e30a96c2/kafka/topic-yhtponiaak/61_25/10-11-2000292-1-v1.log.1_chunks/0"].">

JIRA Link: CORE-4237

vbotbuildovich commented 2 weeks ago

*https://buildkite.com/redpanda/vtools/builds/15454

vbotbuildovich commented 1 week ago

*https://buildkite.com/redpanda/vtools/builds/15617

vbotbuildovich commented 4 days ago

*https://buildkite.com/redpanda/vtools/builds/15817

vbotbuildovich commented 2 days ago

*https://buildkite.com/redpanda/vtools/builds/15844