redpanda-data / redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
https://redpanda.com
9.16k stars 558 forks source link

CI Failure (cluster_recovery_complete did not occur in time) in `ClusterRecoveryTest.test_basic_controller_snapshot_restore` #19323

Open vbotbuildovich opened 3 weeks ago

vbotbuildovich commented 3 weeks ago

https://buildkite.com/redpanda/redpanda/builds/50072

Module: rptest.tests.cluster_recovery_test
Class: ClusterRecoveryTest
Method: test_basic_controller_snapshot_restore
test_id:    ClusterRecoveryTest.test_basic_controller_snapshot_restore
status:     FAIL
run time:   74.176 seconds

ConnectionError(MaxRetryError("HTTPConnectionPool(host='docker-rp-16', port=9644): Max retries exceeded with url: /v1/cloud_storage/automated_recovery (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f701e2b2920>: Failed to establish a new connection: [Errno 111] Connection refused'))"))
Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/urllib3/connection.py", line 159, in _new_conn
    conn = connection.create_connection(
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/urllib3/util/connection.py", line 84, in create_connection
    raise err
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/urllib3/util/connection.py", line 74, in create_connection
    sock.connect(sa)
ConnectionRefusedError: [Errno 111] Connection refused

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/urllib3/connectionpool.py", line 670, in urlopen
    httplib_response = self._make_request(
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/urllib3/connectionpool.py", line 392, in _make_request
    conn.request(method, url, **httplib_request_kw)
  File "/usr/lib/python3.10/http/client.py", line 1283, in request
    self._send_request(method, url, body, headers, encode_chunked)
  File "/usr/lib/python3.10/http/client.py", line 1329, in _send_request
    self.endheaders(body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.10/http/client.py", line 1278, in endheaders
    self._send_output(message_body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.10/http/client.py", line 1038, in _send_output
    self.send(msg)
  File "/usr/lib/python3.10/http/client.py", line 976, in send
    self.connect()
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/urllib3/connection.py", line 187, in connect
    conn = self._new_conn()
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/urllib3/connection.py", line 171, in _new_conn
    raise NewConnectionError(
urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7f701e2b2920>: Failed to establish a new connection: [Errno 111] Connection refused

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/requests/adapters.py", line 439, in send
    resp = conn.urlopen(
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/urllib3/connectionpool.py", line 726, in urlopen
    retries = retries.increment(
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/urllib3/util/retry.py", line 446, in increment
    raise MaxRetryError(_pool, url, error or ResponseError(cause))
urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='docker-rp-16', port=9644): Max retries exceeded with url: /v1/cloud_storage/automated_recovery (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f701e2b2920>: Failed to establish a new connection: [Errno 111] Connection refused'))

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 184, in _do_run
    data = self.run_test()
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 276, in run_test
    return self.test_context.function(self.test)
  File "/root/tests/rptest/services/cluster.py", line 105, in wrapped
    r = f(self, *args, **kwargs)
  File "/root/tests/rptest/tests/cluster_recovery_test.py", line 115, in test_basic_controller_snapshot_restore
    wait_until(cluster_recovery_complete, timeout_sec=30, backoff_sec=1)
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/utils/util.py", line 53, in wait_until
    raise e
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/utils/util.py", line 44, in wait_until
    if condition():
  File "/root/tests/rptest/tests/cluster_recovery_test.py", line 112, in cluster_recovery_complete
    return "inactive" in self.redpanda._admin.get_cluster_recovery_status(
  File "/root/tests/rptest/services/admin.py", line 1280, in get_cluster_recovery_status
    return self._request('get', "cloud_storage/automated_recovery",
  File "/root/tests/rptest/services/admin.py", line 546, in _request
    r = self._session.request(verb, url, **kwargs)
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/requests/sessions.py", line 530, in request
    resp = self.send(prep, **send_kwargs)
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/requests/sessions.py", line 643, in send
    r = adapter.send(request, **kwargs)
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/requests/adapters.py", line 516, in send
    raise ConnectionError(e, request=request)
requests.exceptions.ConnectionError: HTTPConnectionPool(host='docker-rp-16', port=9644): Max retries exceeded with url: /v1/cloud_storage/automated_recovery (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f701e2b2920>: Failed to establish a new connection: [Errno 111] Connection refused'))

JIRA Link: CORE-3707

vbotbuildovich commented 3 weeks ago

*https://buildkite.com/redpanda/redpanda/builds/50072