Open marcin-krystianc opened 4 months ago
cluster-log | grep -i "topic126-8"
-> topic126-8.txt
cluster-log | tail -n 50000
-> tail.txt
Rolling restart log:
.\rolling-restart.ps1
2024-06-17 17:25:24 Stopping container: kafka-1
2024-06-17 17:25:24 Waiting for container to stop...
2024-06-17 17:26:16 Container has stopped.
2024-06-17 17:26:16 Starting container: kafka-1
kafka-1
2024-06-17 17:26:17 Container started. Waiting for cluster to catch-up
2024-06-17 17:26:45 Waiting 30s for cluster to rebalance
2024-06-17 17:27:15 Stopping container: kafka-2
2024-06-17 17:27:15 Waiting for container to stop...
2024-06-17 17:27:50 Container has stopped.
2024-06-17 17:27:50 Starting container: kafka-2
kafka-2
2024-06-17 17:27:51 Container started. Waiting for cluster to catch-up
2024-06-17 17:28:00 Waiting 30s for cluster to rebalance
2024-06-17 17:28:30 Stopping container: kafka-3
2024-06-17 17:28:30 Waiting for container to stop...
2024-06-17 17:28:33 Container has stopped.
2024-06-17 17:28:33 Starting container: kafka-3
kafka-3
2024-06-17 17:28:33 Container started. Waiting for cluster to catch-up
2024-06-17 17:28:46 Waiting 30s for cluster to rebalance
2024-06-17 17:29:16 Stopping container: kafka-1
2024-06-17 17:29:16 Waiting for container to stop...
2024-06-17 17:29:56 Container has stopped.
2024-06-17 17:29:56 Starting container: kafka-1
kafka-1
2024-06-17 17:29:57 Container started. Waiting for cluster to catch-up
2024-06-17 17:30:13 Waiting 30s for cluster to rebalance
2024-06-17 17:30:43 Stopping container: kafka-2
2024-06-17 17:30:43 Waiting for container to stop...
2024-06-17 17:31:15 Container has stopped.
2024-06-17 17:31:15 Starting container: kafka-2
kafka-2
2024-06-17 17:31:15 Container started. Waiting for cluster to catch-up
2024-06-17 17:31:26 Waiting 30s for cluster to rebalance
2024-06-17 17:31:56 Stopping container: kafka-3
2024-06-17 17:31:56 Waiting for container to stop...
2024-06-17 17:32:25 Container has stopped.
2024-06-17 17:32:25 Starting container: kafka-3
kafka-3
2024-06-17 17:32:25 Container started. Waiting for cluster to catch-up
2024-06-17 17:32:35 Waiting 30s for cluster to rebalance
2024-06-17 17:33:05 Stopping container: kafka-1
2024-06-17 17:33:05 Waiting for container to stop...
2024-06-17 17:33:39 Container has stopped.
2024-06-17 17:33:39 Starting container: kafka-1
kafka-1
2024-06-17 17:33:40 Container started. Waiting for cluster to catch-up
2024-06-17 17:34:04 Waiting 30s for cluster to rebalance
2024-06-17 17:34:34 Stopping container: kafka-2
2024-06-17 17:34:34 Waiting for container to stop...
2024-06-17 17:35:16 Container has stopped.
2024-06-17 17:35:16 Starting container: kafka-2
kafka-2
2024-06-17 17:35:16 Container started. Waiting for cluster to catch-up
2024-06-17 17:35:27 Waiting 30s for cluster to rebalance
2024-06-17 17:35:57 Stopping container: kafka-3
2024-06-17 17:35:58 Waiting for container to stop...
2024-06-17 17:36:35 Container has stopped.
2024-06-17 17:36:35 Starting container: kafka-3
kafka-3
2024-06-17 17:36:35 Container started. Waiting for cluster to catch-up
2024-06-17 17:36:45 Waiting 30s for cluster to rebalance
2024-06-17 17:37:15 Stopping container: kafka-1
2024-06-17 17:37:15 Waiting for container to stop...
2024-06-17 17:37:54 Container has stopped.
2024-06-17 17:37:54 Starting container: kafka-1
kafka-1
2024-06-17 17:37:55 Container started. Waiting for cluster to catch-up
2024-06-17 17:38:11 Waiting 30s for cluster to rebalance
2024-06-17 17:38:41 Stopping container: kafka-2
2024-06-17 17:38:41 Waiting for container to stop...
2024-06-17 17:39:06 Container has stopped.
2024-06-17 17:39:06 Starting container: kafka-2
kafka-2
2024-06-17 17:39:06 Container started. Waiting for cluster to catch-up
2024-06-17 17:39:16 Waiting 30s for cluster to rebalance
2024-06-17 17:39:46 Stopping container: kafka-3
2024-06-17 17:39:47 Waiting for container to stop...
2024-06-17 17:40:08 Container has stopped.
2024-06-17 17:40:08 Starting container: kafka-3
kafka-3
2024-06-17 17:40:09 Container started. Waiting for cluster to catch-up
2024-06-17 17:40:20 Waiting 30s for cluster to rebalance
2024-06-17 17:40:50 Stopping container: kafka-1
2024-06-17 17:40:50 Waiting for container to stop...
2024-06-17 17:41:17 Container has stopped.
2024-06-17 17:41:17 Starting container: kafka-1
kafka-1
2024-06-17 17:41:18 Container started. Waiting for cluster to catch-up
2024-06-17 17:41:34 Waiting 30s for cluster to rebalance
2024-06-17 17:42:04 Stopping container: kafka-2
2024-06-17 17:42:05 Waiting for container to stop...
2024-06-17 17:42:29 Container has stopped.
2024-06-17 17:42:29 Starting container: kafka-2
kafka-2
2024-06-17 17:42:29 Container started. Waiting for cluster to catch-up
2024-06-17 17:42:40 Waiting 30s for cluster to rebalance
2024-06-17 17:43:10 Stopping container: kafka-3
2024-06-17 17:43:10 Waiting for container to stop...
2024-06-17 17:43:34 Container has stopped.
2024-06-17 17:43:34 Starting container: kafka-3
kafka-3
2024-06-17 17:43:34 Container started. Waiting for cluster to catch-up
2024-06-17 17:43:45 Waiting 30s for cluster to rebalance
2024-06-17 17:44:15 Stopping container: kafka-1
2024-06-17 17:44:15 Waiting for container to stop...
2024-06-17 17:44:51 Container has stopped.
2024-06-17 17:44:51 Starting container: kafka-1
kafka-1
2024-06-17 17:44:51 Container started. Waiting for cluster to catch-up
2024-06-17 17:45:24 Waiting 30s for cluster to rebalance
2024-06-17 17:45:54 Stopping container: kafka-2
2024-06-17 17:45:54 Waiting for container to stop...
2024-06-17 17:46:27 Container has stopped.
2024-06-17 17:46:27 Starting container: kafka-2
kafka-2
2024-06-17 17:46:27 Container started. Waiting for cluster to catch-up
2024-06-17 17:46:37 Waiting 30s for cluster to rebalance
2024-06-17 17:47:07 Stopping container: kafka-3
2024-06-17 17:47:08 Waiting for container to stop...
2024-06-17 17:47:48 Container has stopped.
2024-06-17 17:47:48 Starting container: kafka-3
kafka-3
2024-06-17 17:47:48 Container started. Waiting for cluster to catch-up
2024-06-17 17:48:06 Waiting 30s for cluster to rebalance
2024-06-17 17:48:36 Stopping container: kafka-1
2024-06-17 17:48:36 Waiting for container to stop...
2024-06-17 17:49:27 Container has stopped.
2024-06-17 17:49:27 Starting container: kafka-1
kafka-1
2024-06-17 17:49:28 Container started. Waiting for cluster to catch-up
2024-06-17 17:50:06 Waiting 30s for cluster to rebalance
2024-06-17 17:50:36 Stopping container: kafka-2
2024-06-17 17:50:36 Waiting for container to stop...
2024-06-17 17:51:13 Container has stopped.
2024-06-17 17:51:13 Starting container: kafka-2
kafka-2
2024-06-17 17:51:13 Container started. Waiting for cluster to catch-up
2024-06-17 17:51:30 Waiting 30s for cluster to rebalance
2024-06-17 17:52:00 Stopping container: kafka-3
2024-06-17 17:52:00 Waiting for container to stop...
2024-06-17 17:52:41 Container has stopped.
2024-06-17 17:52:41 Starting container: kafka-3
kafka-3
2024-06-17 17:52:41 Container started. Waiting for cluster to catch-up
2024-06-17 17:53:04 Waiting 30s for cluster to rebalance
2024-06-17 17:53:34 Stopping container: kafka-1
2024-06-17 17:53:34 Waiting for container to stop...
2024-06-17 17:54:07 Container has stopped.
2024-06-17 17:54:07 Starting container: kafka-1
kafka-1
2024-06-17 17:54:08 Container started. Waiting for cluster to catch-up
2024-06-17 17:54:33 Waiting 30s for cluster to rebalance
2024-06-17 17:55:03 Stopping container: kafka-2
2024-06-17 17:55:03 Waiting for container to stop...
2024-06-17 17:56:01 Container has stopped.
2024-06-17 17:56:01 Starting container: kafka-2
I think this is relevant ('topic126-8.txt'):
Line 73: kafka-1 | [2024-06-17 15:30:11,495] WARN [ReplicaFetcher replicaId=1, leaderId=3, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)
Line 382: kafka-3 | [2024-06-17 15:33:05,922] WARN [ReplicaFetcher replicaId=3, leaderId=1, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)
Line 755: kafka-3 | [2024-06-17 15:37:15,831] WARN [ReplicaFetcher replicaId=3, leaderId=1, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)
Line 1083: kafka-3 | [2024-06-17 15:40:50,646] WARN [ReplicaFetcher replicaId=3, leaderId=1, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)
Line 1461: kafka-3 | [2024-06-17 15:44:16,388] WARN [ReplicaFetcher replicaId=3, leaderId=1, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)
Line 1491: kafka-1 | [2024-06-17 15:45:17,802] WARN [ReplicaFetcher replicaId=1, leaderId=3, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)
Line 1996: kafka-3 | [2024-06-17 15:48:37,050] WARN [ReplicaFetcher replicaId=3, leaderId=1, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)
Line 2170: kafka-1 | [2024-06-17 15:50:20,241] WARN [ReplicaFetcher replicaId=1, leaderId=3, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)
Line 3252: kafka-3 | [2024-06-17 15:53:35,076] WARN [ReplicaFetcher replicaId=3, leaderId=1, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)
Line 3332: kafka-1 | [2024-06-17 15:54:33,799] WARN [ReplicaFetcher replicaId=1, leaderId=3, fetcherId=0] Partition topic126-8 marked as failed (kafka.server.ReplicaFetcherThread)