Open nicholaspearson opened 4 years ago
A few notes.
same problem here using 2 raspberry nodes (both are manager nodes)
Any response from repo maintainers on this?
Same issue using Raspberry 3 Model 8+. Any Solutions?
I ran into the same problem on a Raspberry Pi Model 3, does anyone found a workaround yet?
Edit: If there is any information you need to debug, let me know I am willing to help.
Same problem on an ASUSTOR (busybox) install.
I have the same error message in SYSLOG, running Docker 18.9.1 on armv7l (RPI4 in 32bit mode) in swarm mode with one master node only. However my container start normally and I can also access them from the outside network over the bridge. But I wonder where BR0 comes from, my Bridge runs under /dev/docker_gwbridge and not br0. Is that the same with you guys? Other than that I can confirm @nicholaspearson 's observation, the directory (and the device) do not exist, Hence the error message. I am using overlays as well.
Same issue here. Have a four node Raspberry PI 4 4GB ARM-64 cluster w/ one master node running 64-bit Raspberry OS (debian based). I'm seeing this behavior on worker node-1 and node-3. I set up Loki on the cluster to aggregate the various logs and found that when this happens the nodes seem to still work but just loses all network connectivity inbound and outbound. I lose access to SSH and everything but seems the node stays active, just in a disconnected state.
Here's some logs that are possibly related?
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.317368109Z" level=error msg="failed adding service binding for 55f332a13546a734f6f69fad8a6f3b7d462a076e676a9839d55141ac81363cb5 epRec:{cluster-monitoring_grafana.1.si6tnkplvmbnk14jf7p4v4fdp cluster-monitoring_grafana ld6u2u6094euz0ppgzq33p829 172.20.5.2 172.20.5.3 [] [grafana] [493f5823c508] false} err:network r0xt6yl6vfqw4fgs45a7ch7p0 not found"
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.313926739Z" level=error msg="failed adding service binding for 51cc84fc5efe2f2a7eba9ebab2638971c987dbcee7e194ab97b3826e1a1459ab epRec:{cluster-monitoring_prometheus.1.x8s1rjddtzpbca97mo6qxfcbo cluster-monitoring_prometheus iibue4vaw0r2qxcfajzpbxfgz 172.20.5.22 172.20.5.23 [] [prometheus] [516f7b187774] false} err:network r0xt6yl6vfqw4fgs45a7ch7p0 not found"
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.312982424Z" level=error msg="failed adding service binding for 384f771627c9de529758ab8374a3b265a5013de01de10b5b6ef714b1c413a479 epRec:{cluster-monitoring_node-exporter.rrzqexbfbe3rd900u86az51i3.dr9fach7qdlu7j5fm7m5pvdiq cluster-monitoring_node-exporter dfrudo8og7rzvuzlih0f3wx2a 172.20.5.17 172.20.5.21 [] [node-exporter] [4c97770fd8b5] false} err:network r0xt6yl6vfqw4fgs45a7ch7p0 not found"
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.311953776Z" level=error msg="failed adding service binding for 2f9a80189de15f120549463a607ff5f848ef0bad4c1a977f5cf040d9ccd98b7b epRec:{cluster-monitoring_promtail.rrzqexbfbe3rd900u86az51i3.u0742bms5u85p26wyd1bxuzv4 cluster-monitoring_promtail 8zwobb0zawd7ekinnn60u9d9d 172.20.5.7 172.20.5.10 [] [promtail] [77eefb6b207b] false} err:network r0xt6yl6vfqw4fgs45a7ch7p0 not found"
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.308033794Z" level=error msg="failed to get network during CreateEndpoint: network r0xt6yl6vfqw4fgs45a7ch7p0 not found"
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.306874943Z" level=error msg="failed to get network during CreateEndpoint: network r0xt6yl6vfqw4fgs45a7ch7p0 not found"
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.306136906Z" level=error msg="failed to get network during CreateEndpoint: network r0xt6yl6vfqw4fgs45a7ch7p0 not found"
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.305671702Z" level=error msg="network cluster-monitoring_monitoring remove failed: error while removing network: unknown network cluster-monitoring_monitoring id r0xt6yl6vfqw4fgs45a7ch7p0" module=node/agent node.id=mpuf1oxq1pwnanun9pwhjeo7t
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.305231609Z" level=error msg="failed to get network during CreateEndpoint: network r0xt6yl6vfqw4fgs45a7ch7p0 not found"
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.304561035Z" level=error msg="network cluster-monitoring_monitoring remove failed: error while removing network: unknown network cluster-monitoring_monitoring id r0xt6yl6vfqw4fgs45a7ch7p0" module=node/agent node.id=mpuf1oxq1pwnanun9pwhjeo7t
2020-12-22 21:47:03 | Dec 23 03:17:34 swarm-node-3 dockerd[526]: time="2020-12-23T03:17:34.292442757Z" level=error msg="network cluster-monitoring_monitoring remove failed: error while removing network: unknown network cluster-monitoring_monitoring id r0xt6yl6vfqw4fgs45a7ch7p0" module=node/agent node.id=mpuf1oxq1pwnanun9pwhjeo7t
A little about the setup, have a 5TB Raid 10 drive connected to the master node + a 500 GB SSD. Both drives are shared across nfs to the other 3 nodes where the 500 GB SSD is used for docker volume data. I have a few services that are hosted on outside of the swarm since it doesn't support using mounting devices or tunnels so I simply use a reverse proxy to route traffic through the swarm network.
Just to re-emphasize, I have it set up to stream syslogs, when this issue hits, the log stream halts because all the network traffic just dies. The containers still operate, just w/o network access. Then once I reboot the node, the network resets and the logs are flushed all at once. I have an hdmi to usb 3 capture stick on order that comes in tomorrow to I should be able use my tablet as a monitor to debug it.
Expected behavior
Container will start when a stack deploy command is ran.
Actual behavior
Container does not start, what seems to be due to the following error found in journal.
Output of
docker version
:Output of
docker info
:Additional environment details (AWS, VirtualBox, physical, etc.)
8 node Raspberry pi Docker Swarm cluster with 2 manager.