Open zdexter opened 8 years ago
I am having the same issues except using docker 1.12 and machine 0.8.0 build b85aac1
After some testing it seems like it has something to do with the NAT network within virtualbox failing. It seems to be triggered at least in my case by "large" downloads and no amount of restarting my machine or the vm seems to be able to get around the issue.
in my case, restarting the vm's host fixes the issue. no observed correlation with downloads.
Well restarting the vm works until I do something more complex then ping inside a container. Then 85-90% of the time anything that has to go over the NAT network dies.
It seems like it is a issue with the virtualbox NAT network but I don't really know a good way to test.
I downgraded to use the 1.11.2 boot2docker image and the issue is fixed. It seems like there is a network driver issue in the newer kernels. @zdexter can you confirm this on your end?
Environment
Network Information
I have various other network interfaces, but am not aware that any others are relevant. None overlap with the
192.168.99.0/24
subnet.Reproduction steps
docker-machine start my-machine
... then I can observe the machine in a
Timeout
state duringdocker-machine ls
. activedocker exec
sessions print something like:Other possible causes
Sleeping the host seems to make this happen more often, and more quickly than 20-30min. Sometimes I use the VM after returning from a sleep before it enters a
Timeout
state, and sometimes it's already in aTimeout
state.Past reports
This PR has a unit test that asserts that
docker-machine
should raise an error in a variety of subnet-overlap cases, but I don't have any other subnets besidesvboxnet0
that use192.168.99.0/24
. https://github.com/docker/machine/pull/3165Known temporary fixes
Restarting the osx host results in a fix that lasts 20min-1hr.