ocf / kafka

Experimental Kafka Cluster @ OCF running on Kubernetes
Other
0 stars 0 forks source link

I think the DNS is like, fucked #4

Open Baisang opened 4 years ago

Baisang commented 4 years ago

Cluster can sometimes get into a weird state where pods are no longer discoverable via the k8s service hostnames...

My guess is there is some DNS cache that is not being expired when pods get scheduled on new hosts. I guess existing somewhere on k8s side, since when we restart pods the problem persists (sometimes the pods can't even talk to each other!) This affects ZK too, unfortunately...

Some ideas for how to fix this are

cg505 commented 4 years ago

same