confluentinc / cp-docker-images

[DEPRECATED] Docker images for Confluent Platform.
Apache License 2.0
1.14k stars 703 forks source link

Controller to Broker - Socket Timeout Exception after restarting the VM - Urgent #656

Open jvarma2306 opened 5 years ago

jvarma2306 commented 5 years ago

Hi All,

We are using confluentinc/cp-kafka:3.2.1 for one of out enterprise product. Below is the Broker Configuration.

kafka-1: deploy: placement: constraints:

We are running this container on RHEL 7.4 version docker. When we start the docker and KAFKA, every thing works fine.

We have a usecase where we want to verify the recovery of KAFKA one VM / HOST reboot or power cycle. This case is failing. When we reboot the Host / VM, KAFKA on start is throwing this ERROR continuously.

[2018-12-18 17:14:29,589] WARN [Controller-1-to-broker-1-send-thread], Controller 1's connection to broker :19092 (id: 1 rack: null) was unsuccessful (kafka.controller.RequestSendThread) java.net.SocketTimeoutException: Failed to connect within 30000 ms at kafka.controller.RequestSendThread.brokerReady(ControllerChannelManager.scala:237) at kafka.controller.RequestSendThread.liftedTree1$1(ControllerChannelManager.scala:189) at kafka.controller.RequestSendThread.doWork(ControllerChannelManager.scala:188) at kafka.utils.ShutdownableThread.run(ShutdownableThread.scala:63)

If we un-install and re-install, it works fine. But again reboot of VM this throws this error.

We need the experts help here. Thanks in Advance.

OneCricketeer commented 5 years ago

We have a usecase where we want to verify the recovery of KAFKA one VM / HOST reboot or power cycle

In order to do this, you need more than one Zookeeper and Kafka broker on different machines that will not be shutdown, and you need to have more than one replica for all your topics see REPLICATION_FACTOR: 1 configuration

cobolbaby commented 5 years ago

Is there a solution?