Closed NyaliaLui closed 1 year ago
on (arm64, container) in job https://buildkite.com/redpanda/redpanda/builds/29289#01882851-e706-42d6-8c5d-f50a80a86b6c
This is a shutdown issue. A node was stopped shortly before it was asked to decommission itself and it failed:
[INFO - 2023-06-14 02:02:11,738 - failure_injector - inject_failure - lineno:79]: injecting failure: type: 1, length: 0 seconds, node: docker-rp-2
[INFO - 2023-06-14 02:02:11,739 - failure_injector - _terminate - lineno:209]: terminating redpanda on docker-rp-2
...
[INFO - 2023-06-14 02:02:11,786 - node_operations - decommission - lineno:231]: executor - decommissioning node 1 (idx: 2)
[DEBUG - 2023-06-14 02:02:11,789 - admin - _request - lineno:332]: Dispatching put http://docker-rp-2:9644/v1/brokers/1/decommission
[WARNING - 2023-06-14 02:02:11,796 - admin - _request - lineno:350]: Response 500: {"message": "Unexpected error: rpc::errc::missing_node_rpc_client", "code": 500}
Looks like https://github.com/redpanda-data/redpanda/pull/8847 is something that might have caused it?
In any case, low severity, and the fix is to return some kind of "shutting down" error code, instead of a more disturbing rpc::errc::missing_node_rpc_client
.
Another instance:
FAIL test: RandomNodeOperationsTest.test_node_operations.enable_failures=True.num_to_upgrade=0.enable_controller_snapshots=True (1/31 runs)
failure at 2023-06-19T14:48:04.475Z:
https://buildkite.com/redpanda/redpanda/builds/27206#0187809b-cc6a-48fa-b257-9dde43c9df11