Open alekc opened 5 years ago
Similar issue, providing the logs for comparison:
systemd[1]: Starting Docker Application Container Engine...
dockerd[7932]: time="2020-04-03T01:17:00.994257494+02:00" level=info msg="Starting up"
dockerd[7932]: time="2020-04-03T01:17:00.995677901+02:00" level=info msg="parsed scheme: \"unix\"" module=grpc
dockerd[7932]: time="2020-04-03T01:17:00.996081387+02:00" level=info msg="scheme \"unix\" not registered, fallback to default scheme" module=grpc
dockerd[7932]: time="2020-04-03T01:17:00.996428662+02:00" level=info msg="ccResolverWrapper: sending update to cc: {[{unix:///run/containerd/containerd.sock 0 <nil>}] <nil>}" module=grpc
dockerd[7932]: time="2020-04-03T01:17:00.996720686+02:00" level=info msg="ClientConn switching balancer to \"pick_first\"" module=grpc
dockerd[7932]: time="2020-04-03T01:17:00.998190806+02:00" level=info msg="parsed scheme: \"unix\"" module=grpc
dockerd[7932]: time="2020-04-03T01:17:00.998210353+02:00" level=info msg="scheme \"unix\" not registered, fallback to default scheme" module=grpc
dockerd[7932]: time="2020-04-03T01:17:00.998228833+02:00" level=info msg="ccResolverWrapper: sending update to cc: {[{unix:///run/containerd/containerd.sock 0 <nil>}] <nil>}" module=grpc
dockerd[7932]: time="2020-04-03T01:17:00.998238866+02:00" level=info msg="ClientConn switching balancer to \"pick_first\"" module=grpc
dockerd[7932]: time="2020-04-03T01:17:01.013037586+02:00" level=info msg="[graphdriver] using prior storage driver: overlay2"
dockerd[7932]: time="2020-04-03T01:17:01.035758061+02:00" level=warning msg="Your kernel does not support swap memory limit"
dockerd[7932]: time="2020-04-03T01:17:01.035789780+02:00" level=warning msg="Your kernel does not support cgroup rt period"
dockerd[7932]: time="2020-04-03T01:17:01.035796358+02:00" level=warning msg="Your kernel does not support cgroup rt runtime"
dockerd[7932]: time="2020-04-03T01:17:01.035932766+02:00" level=info msg="Loading containers: start."
dockerd[7932]: time="2020-04-03T01:17:01.548248367+02:00" level=error msg="getNetworkFromStore for nid 446e39b990727a489a27bb95c699551bdbb99c1e0b4e13d3f9f2f34e2ef85ae3 failed while trying to build sandbox for cleanup: network 446e39b990727a489a27bb95c699551bdbb99c1e0b4e13d3f9f2f34e2ef85ae3 not found"
dockerd[7932]: time="2020-04-03T01:17:01.548291835+02:00" level=info msg="Removing stale sandbox 08ee73a0483cb9cdc3d4fbec3cd759554f3e1865d2baa7aafc94fe2a315c45b5 (1d3637daa9d9dd7d596e4db50a32a02a21c42d4810753055785e36538008ac5a)"
dockerd[7932]: time="2020-04-03T01:17:01.567197747+02:00" level=warning msg="Error (Unable to complete atomic operation, key modified) deleting object [endpoint 2c59d4728bbae09c6f816e5c3d6a89b32c841ef6bdf98f4af99091df28d638d2 8283e268f371afe93ca0dafb5653231f26fbafb250e360cb90b7b20b9d909a05], retrying...."
dockerd[7932]: time="2020-04-03T01:17:01.569913193+02:00" level=warning msg="Failed getting network for ep 6fb6efbdfccb4953878a23713ffb78cb5b4c09dbb39df376d6e2299ae8d06030 during sandbox 08ee73a0483cb9cdc3d4fbec3cd759554f3e1865d2baa7aafc94fe2a315c45b5 delete: network 446e39b990727a489a27bb95c699551bdbb99c1e0b4e13d3f9f2f34e2ef85ae3 not found"
dockerd[7932]: time="2020-04-03T01:17:01.569943216+02:00" level=error msg="Failed to delete sandbox 08ee73a0483cb9cdc3d4fbec3cd759554f3e1865d2baa7aafc94fe2a315c45b5 while trying to cleanup: could not cleanup all the endpoints in container 1d3637daa9d9dd7d596e4db50a32a02a21c42d4810753055785e36538008ac5a / sandbox 08ee73a0483cb9cdc3d4fbec3cd759554f3e1865d2baa7aafc94fe2a315c45b5"
dockerd[7932]: time="2020-04-03T01:17:01.663561528+02:00" level=info msg="Removing stale sandbox 9ddaa33a8223ce77c5c346a75c01e3adfa7b2d6ada339c67efed732a93f30f71 (5227e8bc97c0fd56360d5577de24b3ae724c2e63364f3d8ab16c650c782afef5)"
dockerd[7932]: time="2020-04-03T01:17:01.681488389+02:00" level=info msg="Removing stale endpoint website_db_1 (0f827b20d25afcf3afb68bdf7d2f8b10dc1a3d4219e5f621f53cdf5b290c17be)"
dockerd[7932]: time="2020-04-03T01:17:01.730543201+02:00" level=info msg="Fixing inconsistent endpoint_cnt for network website_default. Expected=0, Actual=1"
dockerd[7932]: time="2020-04-03T01:17:01.758639580+02:00" level=info msg="Default bridge (docker0) is assigned with an IP address 172.17.0.0/16. Daemon option --bip can be used to set a preferred IP address"
dockerd[7932]: time="2020-04-03T01:17:01.800804484+02:00" level=warning msg="Failed getting network for ep 6fb6efbdfccb4953878a23713ffb78cb5b4c09dbb39df376d6e2299ae8d06030 during sandbox 08ee73a0483cb9cdc3d4fbec3cd759554f3e1865d2baa7aafc94fe2a315c45b5 delete: network 446e39b990727a489a27bb95c699551bdbb99c1e0b4e13d3f9f2f34e2ef85ae3 not found"
dockerd[7932]: time="2020-04-03T01:17:01.801431437+02:00" level=error msg="failed to cleanup up stale network sandbox for container 1d3637daa9d9dd7d596e4db50a32a02a21c42d4810753055785e36538008ac5a"
dockerd[7932]: time="2020-04-03T01:17:01.902118731+02:00" level=error msg="stream copy error: reading from a closed fifo"
dockerd[7932]: time="2020-04-03T01:17:01.902203046+02:00" level=error msg="stream copy error: reading from a closed fifo"
dockerd[7932]: panic: runtime error: invalid memory address or nil pointer dereference
dockerd[7932]: [signal SIGSEGV: segmentation violation code=0x1 addr=0x18 pc=0x5647a626be94]
dockerd[7932]: goroutine 391 [running]:
dockerd[7932]: github.com/docker/docker/vendor/github.com/docker/libnetwork.(*endpoint).addServiceInfoToCluster(0xc000c87080, 0xc000c966c0, 0x0, 0x0)
dockerd[7932]: /go/src/github.com/docker/docker/vendor/github.com/docker/libnetwork/agent.go:599 +0xf74
dockerd[7932]: github.com/docker/docker/vendor/github.com/docker/libnetwork.(*sandbox).EnableService(0xc000c966c0, 0x0, 0x0)
dockerd[7932]: /go/src/github.com/docker/docker/vendor/github.com/docker/libnetwork/sandbox.go:702 +0x187
dockerd[7932]: github.com/docker/docker/daemon.(*Daemon).ActivateContainerServiceBinding(0xc00000c3c0, 0xc000aac506, 0x7, 0x0, 0x0)
dockerd[7932]: /go/src/github.com/docker/docker/daemon/container_operations.go:1118 +0x16a
dockerd[7932]: github.com/docker/docker/daemon.(*Daemon).connectToNetwork(0xc00000c3c0, 0xc00077e6c0, 0x5647a6d6e930, 0x6, 0xc000a7c0c0, 0x0, 0x0, 0x0)
dockerd[7932]: /go/src/github.com/docker/docker/daemon/container_operations.go:800 +0xa13
dockerd[7932]: github.com/docker/docker/daemon.(*Daemon).allocateNetwork(0xc00000c3c0, 0xc00077e6c0, 0x0, 0x0)
dockerd[7932]: /go/src/github.com/docker/docker/daemon/container_operations.go:543 +0x9b2
dockerd[7932]: github.com/docker/docker/daemon.(*Daemon).initializeNetworking(0xc00000c3c0, 0xc00077e6c0, 0x0, 0x0)
dockerd[7932]: /go/src/github.com/docker/docker/daemon/container_operations.go:957 +0x9a
dockerd[7932]: github.com/docker/docker/daemon.(*Daemon).containerStart(0xc00000c3c0, 0xc00077e6c0, 0x0, 0x0, 0x0, 0x0, 0x1, 0x0, 0x0)
dockerd[7932]: /go/src/github.com/docker/docker/daemon/start.go:150 +0x310
dockerd[7932]: github.com/docker/docker/daemon.(*Daemon).restore.func5(0xc0000e02d0, 0xc00000c3c0, 0xc000405500, 0xc000560ea0, 0xc00077e6c0, 0xc000c94720)
dockerd[7932]: /go/src/github.com/docker/docker/daemon/daemon.go:501 +0x2c9
dockerd[7932]: created by github.com/docker/docker/daemon.(*Daemon).restore
dockerd[7932]: /go/src/github.com/docker/docker/daemon/daemon.go:482 +0x744
systemd[1]: docker.service: Main process exited, code=exited, status=2/INVALIDARGUMENT
systemd[1]: Failed to start Docker Application Container Engine.
systemd[1]: docker.service: Unit entered failed state.
systemd[1]: docker.service: Failed with result 'exit-code'.
systemd[1]: docker.service: Service hold-off time over, scheduling restart.
systemd[1]: Stopped Docker Application Container Engine.
systemd[1]: docker.service: Start request repeated too quickly.
systemd[1]: Failed to start Docker Application Container Engine.
systemd[1]: docker.service: Unit entered failed state.
systemd[1]: docker.service: Failed with result 'exit-code'.
dockerd -v
: Docker version 19.03.8, build afacb8b7f0
I think I have to nuke /var/lib/docker
. I hope I have backed up everything!!
metoo
I'm seeing similar issue on Docker version 18.09.8, build 0dd43dd
sudo journalctl -u docker -n 50 -l
– Logs begin at Mon 2021-12-06 09:09:42 PST, end at Mon 2021-12-06 13:30:45 PST. –
Dec 06 13:30:30 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:30.755321037-08:00" level=info msg="Removing stale sandbox cd3f42edd7a8a86b35cfd69807799f517061a49128bd41ab1489884f9b96
Dec 06 13:30:31 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:31.239411071-08:00" level=info msg="Removing stale sandbox e2019dfbf65a056ba74d8a767f9267da4689e89b867a1e6f0ba4998abb2d
Dec 06 13:30:31 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:31.520052063-08:00" level=info msg="Removing stale sandbox 06171388d0cc342ac654b2393ef37c84a52e0de07353fb52e95cd04d8dab
Dec 06 13:30:31 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:31.797549538-08:00" level=info msg="Removing stale sandbox fd7b89dffecaa1d7973f7f4aacf328905d24df0db4f04c79bad0e4026636
Dec 06 13:30:32 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:32.097031821-08:00" level=info msg="Removing stale sandbox acc8dc4a7902861e431c071cdb1332e2725841d220548afbd07fa6db7d1a
Dec 06 13:30:32 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:32.465205137-08:00" level=info msg="Removing stale sandbox 962786fa92a5767375da6ec0482a9386de3106a3e0293780381b22ebb68e
Dec 06 13:30:32 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:32.754399081-08:00" level=info msg="Removing stale sandbox 084aaea4244459e1d5afedf72a5bee54d17ae0e49cc072f4be54bfd991e1
Dec 06 13:30:33 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:33.045521628-08:00" level=info msg="Removing stale sandbox 75495bab1a2214e36547678b5700a59a7cb648f96ab79a98c829754ff5ba
Dec 06 13:30:33 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:33.340806234-08:00" level=info msg="Removing stale sandbox 2a6ae04eb62d3880b36383dec4ad189882658fe2a550afdae07ff121c600
Dec 06 13:30:33 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:33.621067954-08:00" level=info msg="Removing stale sandbox 3452b16954e7059d0d9e68a1fa1e8c888106e696736aacbcc76be3b6246b
Dec 06 13:30:33 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:33.910043202-08:00" level=info msg="Removing stale sandbox 6222e20e47ed8c3ba866aea0854289db0d05f6ad1b87e16a59ad2990a07e
Dec 06 13:30:34 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:34.210516507-08:00" level=info msg="Removing stale sandbox 794e4991c6b88cce91de129f8de569a66a0a26d5f11ff521c07642502804
Dec 06 13:30:34 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:34.488873864-08:00" level=info msg="Removing stale sandbox bcd7161c13f4975740fae5de70bfe51e47292085a1e88f67927950b2673b
Dec 06 13:30:34 x.x.x.x dockerd[16206]: time="2021-12-06T13:30:34.764468338-08:00" level=info msg="Removing stale sandbox d8040ca368db0e0f99be486e676c
bash:~$ sudo journalctl -u docker -l | wc -l 43912
+ We had to kill docker process and wait for it remove those stale network interfaces.
Hi all, I face an issue which looks quite similar. Do you have any idea on a workaround?
For Reference:
[root@x.x ~]# journalctl -u docker -n 50 -l -- Logs begin at Tue 2023-01-17 13:33:24 CET, end at Mon 2023-02-13 14:17:39 CET. -- Feb 13 14:17:33 x.x dockerd[9596]: time="2023-02-13T14:17:33.527567369+01:00" level=info msg="Removing stale sandbox f48f6e18ac8f01e9c305c796a9252894ff9b8f0af950b3a0a70e0207ceef5c70 (fc6fcd37cf54d087c37627b5bac4a713884bfbc1481322da27ff28a5c34c19ca)" Feb 13 14:17:33 x.x dockerd[9596]: time="2023-02-13T14:17:33.668269586+01:00" level=info msg="Removing stale sandbox fdef052aa4d0f3f997dd43b67a1f30d28942b7b450d1ad17736916dfdbdcd35b (119932af4811785dde69e818a40e901dde24ff358967fde2cd2decdb7294e3f8)" Feb 13 14:17:33 x.x dockerd[9596]: time="2023-02-13T14:17:33.803477271+01:00" level=info msg="Removing stale sandbox ff55f7c7292690c6fe415125c50fb320e93c17210feb0d869500e339e155507a (8dedc5092b3a8b2d579ba9438466b7032ef54a07662ff715f35ff42818093c6b)" Feb 13 14:17:33 x.x dockerd[9596]: time="2023-02-13T14:17:33.930769176+01:00" level=info msg="Removing stale sandbox 103e19ff12b0f415b61604ebe110b00cf325d13ceba1518c5f2dbcddc75cdeba (c21ae3c59444ae3f6f85a50fd24ab327cdae3113e56d0c589c34737a0f3b67ee)" Feb 13 14:17:36 x.x dockerd[9596]: time="2023-02-13T14:17:36.494434812+01:00" level=info msg="Removing stale sandbox d7a073f06a0acd22637af71e6a8ccda228074cced1c0a0481b999a093d2451ea (251cf7c0ab98f5896e4c980f7f2c91db341eacf7c164c5914f48dfaaa8156e43)" Feb 13 14:17:36 x.x dockerd[9596]: time="2023-02-13T14:17:36.623828201+01:00" level=info msg="Removing stale sandbox 1f215bd9999ca4067c1b0346caae68d5a24a47c8efb9a99e9f513ad35f15d999 (6ab73d128c4e17de862e8d328a6cd40ed1595378b5a2667e094140767b1c644e)" Feb 13 14:17:36 x.x dockerd[9596]: time="2023-02-13T14:17:36.764782584+01:00" level=info msg="Removing stale sandbox 24acb278e43dbdc49b92ca723478c41bc8287ddb5a9fdfad9cb9872681e60d47 (f78e6032cd8b4f25c72b036e70c56351432258a3eb71f8aa0b7ddbe957f405cf)" Feb 13 14:17:36 x.x dockerd[9596]: time="2023-02-13T14:17:36.776151783+01:00" level=error msg="failed to create osl sandbox while trying to restore sandbox de579cb: failed get network namespace \"/var/run/docker/netns/de579cbef9f8\": no such file or directory" Feb 13 14:17:36 x.x dockerd[9596]: time="2023-02-13T14:17:36.900964893+01:00" level=info msg="Removing stale sandbox 78510cc0d316eb4fd1fb1c135c38304b25d3f4545705e16c5dc8ffd9ef9bc9a4 (8c420c5891e5eb47603eeeaf7aba706379bb4375bf5b52bc6c38730d9c93ba2b)"
[root@x.x ~]# journalctl -u docker -l | wc -l 705571
my "workaround" around the stale sandbox is to remove the ../docker/network/files/local-kv.db periodically via cronjob. Not nice, not a solution but it keeps the sandbox removal time low.
Expected behavior
Being able to start docker
Actual behavior
After the last crash, docker is not starting with following log trace:
Steps to reproduce the behavior
Not sure how to reproduce it. But currently docker is down, and there are multiple docker's network interfaces present. If I remove them manually (at least br-xxx ones), then they are recreated when I try to run docker again.
Output of
docker version
:Output of
docker info
: