Hello.
We have cluster of three node: node1, node2, node3.
Let's say, that statsd metrics are sent only to node1.
When we turn off server node2, node1 start to send to carbon more rarely - one time per 1-2 minutes.
When i commented option "nodes" in block "[network]" on node1, all started to be ok.
I think, problem near tcp timeouts when node1 try to communicate with node2 for exchange (or aggregation) metrics.
node1, node2 and node3 in different networks.
So, tcp timeouts not limited by arp request timeouts.
Here screenshot from graphite.
Blue dots - it's all that's left from line.
First time (11:50 - 12:05) - i try to understand, what happening.
Second (12:11 - 12:17) - i commented "nodes" and trying to test the idea
Hello. We have cluster of three node: node1, node2, node3. Let's say, that statsd metrics are sent only to node1. When we turn off server node2, node1 start to send to carbon more rarely - one time per 1-2 minutes. When i commented option "nodes" in block "[network]" on node1, all started to be ok. I think, problem near tcp timeouts when node1 try to communicate with node2 for exchange (or aggregation) metrics. node1, node2 and node3 in different networks. So, tcp timeouts not limited by arp request timeouts.
Here screenshot from graphite. Blue dots - it's all that's left from line. First time (11:50 - 12:05) - i try to understand, what happening. Second (12:11 - 12:17) - i commented "nodes" and trying to test the idea
Config: