redpanda-data / redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
https://redpanda.com
9.39k stars 577 forks source link

CI Failure (key symptom) in `TieredStorageIoStressTest.test_io_stress` #22688

Open vbotbuildovich opened 1 month ago

vbotbuildovich commented 1 month ago

https://buildkite.com/redpanda/vtools/builds/16098

Module: rptest.scale_tests.tiered_storage_io_stress_test
Class: TieredStorageIoStressTest
Method: test_io_stress
Arguments: {
    "segment_size": 1048576,
    "interval_uploads": true,
    "cloud_storage_type": 1
}
test_id:    TieredStorageIoStressTest.test_io_stress
status:     FAIL
run time:   478.807 seconds

<BadLogLines nodes=ip-172-31-11-209(16) example="ERROR 2024-08-01 14:21:38,758 [shard 0:fetc] cloud_storage - cache_service.cc:713 - trim: couldn't delete /var/lib/redpanda/data/cloud_storage_cache/af1370a3-acd6-4ec5-901f-1fffd04b2574/kafka/topic-ueicqinxqo/20_24/60-63-2000508-1-v1.log.1_chunks/0: filesystem error: remove failed: No such file or directory ["/var/lib/redpanda/data/cloud_storage_cache/af1370a3-acd6-4ec5-901f-1fffd04b2574/kafka/topic-ueicqinxqo/20_24/60-63-2000508-1-v1.log.1_chunks/0"].">
Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 184, in _do_run
    data = self.run_test()
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 276, in run_test
    return self.test_context.function(self.test)
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/mark/_mark.py", line 535, in wrapper
    return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
  File "/home/ubuntu/redpanda/tests/rptest/services/cluster.py", line 190, in wrapped
    redpanda.raise_on_bad_logs(
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda.py", line 1576, in raise_on_bad_logs
    lsearcher.search_logs(_searchable_nodes)
  File "/home/ubuntu/redpanda/tests/rptest/services/utils.py", line 197, in search_logs
    raise BadLogLines(bad_loglines)
rptest.services.utils.BadLogLines: <BadLogLines nodes=ip-172-31-11-209(16) example="ERROR 2024-08-01 14:21:38,758 [shard 0:fetc] cloud_storage - cache_service.cc:713 - trim: couldn't delete /var/lib/redpanda/data/cloud_storage_cache/af1370a3-acd6-4ec5-901f-1fffd04b2574/kafka/topic-ueicqinxqo/20_24/60-63-2000508-1-v1.log.1_chunks/0: filesystem error: remove failed: No such file or directory ["/var/lib/redpanda/data/cloud_storage_cache/af1370a3-acd6-4ec5-901f-1fffd04b2574/kafka/topic-ueicqinxqo/20_24/60-63-2000508-1-v1.log.1_chunks/0"].">

JIRA Link: CORE-6765

vbotbuildovich commented 1 month ago

*https://buildkite.com/redpanda/vtools/builds/16154

vbotbuildovich commented 1 month ago

*https://buildkite.com/redpanda/vtools/builds/16225

vbotbuildovich commented 1 month ago

*https://buildkite.com/redpanda/vtools/builds/16241

dotnwat commented 1 month ago

probably a harmless race, but good to fix