redpanda-data / redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
https://redpanda.com
9.44k stars 580 forks source link

CI Failure (key symptom) in `ConfigProfileVerifyTest.test_config_profile_verify` #21401

Closed vbotbuildovich closed 2 months ago

vbotbuildovich commented 2 months ago

https://buildkite.com/redpanda/vtools/builds/15531 https://buildkite.com/redpanda/vtools/builds/15532

Module: rptest.redpanda_cloud_tests.config_profile_verify_test
Class: ConfigProfileVerifyTest
Method: test_config_profile_verify
test_id:    ConfigProfileVerifyTest.test_config_profile_verify
status:     FAIL
run time:   1940.859 seconds

CalledProcessError(1, ['tsh', 'ssh', '--proxy=proxy.tp.redpanda.com:443', '--auth=okta', '--identity=/tmp/machine-id/identity', 'redpanda@cq892nj80m0kpn268tdg-agent', './breakglass-tools.sh'], '', '\x1b[31mERROR: \x1b[0mfailed connecting to host cq892nj80m0kpn268tdg-agent:0: failed to receive cluster details response\n\tfailed to dial target host\n\tTeleport proxy failed to connect to "node" agent "cq892nj80m0kpn268tdg-agent:3022" over direct dial:\n\n  dial tcp: lookup cq892nj80m0kpn268tdg-agent on 127.0.0.53:53: server misbehaving\n\nThis usually means that the agent is offline or has disconnected. Check the\nagent logs and, if the issue persists, try restarting it or re-registering it\nwith the cluster.\n\n')
Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 177, in _do_run
    self.test = self.test_context.cls(self.test_context)
  File "/home/ubuntu/redpanda/tests/rptest/redpanda_cloud_tests/config_profile_verify_test.py", line 31, in __init__
    super().__init__(test_context=test_context)
  File "/home/ubuntu/redpanda/tests/rptest/tests/redpanda_cloud_test.py", line 25, in __init__
    self.redpanda = make_redpanda_cloud_service(test_context)
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda.py", line 5093, in make_redpanda_cloud_service
    return RedpandaServiceCloud(context,
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda.py", line 1706, in __init__
    self.rebuild_pods_classes()
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda.py", line 1734, in rebuild_pods_classes
    for p in self.get_redpanda_pods()
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda.py", line 1793, in get_redpanda_pods
    pods = json.loads(self.kubectl.cmd('get pods -n redpanda -o json'))
  File "/home/ubuntu/redpanda/tests/rptest/clients/kubectl.py", line 269, in cmd
    self._install()
  File "/home/ubuntu/redpanda/tests/rptest/clients/kubectl.py", line 122, in _install
    self._ssh_cmd(['./breakglass-tools.sh'])
  File "/home/ubuntu/redpanda/tests/rptest/clients/kubectl.py", line 252, in _ssh_cmd
    return self._local_cmd(local_cmd)
  File "/home/ubuntu/redpanda/tests/rptest/clients/kubectl.py", line 232, in _local_cmd
    raise subprocess.CalledProcessError(process.returncode, cmd, s_out,
subprocess.CalledProcessError: Command '['tsh', 'ssh', '--proxy=proxy.tp.redpanda.com:443', '--auth=okta', '--identity=/tmp/machine-id/identity', 'redpanda@cq892nj80m0kpn268tdg-agent', './breakglass-tools.sh']' returned non-zero exit status 1.

JIRA Link: CORE-5609

dotnwat commented 2 months ago

dupe