Open Ben-M opened 8 years ago
I investigated this a little further by comparing the broker metadata when Poseidon can produce and when it can't. It cannot produce when every partition in a topic has at least one replica offline. It can produce (to any partition) as long as one partition has all its replicas online. This also means that if you create a topic with a replication factor of 5 (on a 5 broker cluster) if any broker is down then Poseidon cannot produce.
I was experimenting with different failure conditions and found that if I shut down two of five Kafka brokers poseidon would often fail to produce, and instead responded with "Failed to send all messages". More details:
bin/kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 3 --partitions 3 --topic 3
For certain combinations of brokers being down I was able to produce, for other combinations I was not able to produce.