Closed Crandel closed 7 years ago
Please paste the output of docker info
run from your laptop, redacting out any potentially sensitive information.
docker info
Containers: 3
Running: 2
Paused: 0
Stopped: 1
Images: 7
Server Version: 17.03.1-ce
Storage Driver: devicemapper
Pool Name: docker-8:17-20972999-pool
Pool Blocksize: 65.54 kB
Base Device Size: 10.74 GB
Backing Filesystem: xfs
Data file: /dev/loop0
Metadata file: /dev/loop1
Data Space Used: 11.16 GB
Data Space Total: 107.4 GB
Data Space Available: 96.22 GB
Metadata Space Used: 8.757 MB
Metadata Space Total: 2.147 GB
Metadata Space Available: 2.139 GB
Thin Pool Minimum Free Space: 10.74 GB
Udev Sync Supported: true
Deferred Removal Enabled: false
Deferred Deletion Enabled: false
Deferred Deleted Device Count: 0
Data loop file: /media/data/linux/docker/devicemapper/devicemapper/data
WARNING: Usage of loopback devices is strongly discouraged for production use. Use `--storage-opt dm.thinpooldev` to specify a custom block storage device.
Metadata loop file: /media/data/linux/docker/devicemapper/devicemapper/metadata
Library Version: 1.02.138 (2017-03-28)
Logging Driver: json-file
Cgroup Driver: cgroupfs
Plugins:
Volume: local
Network: bridge host macvlan null overlay
Swarm: inactive
Runtimes: runc
Default Runtime: runc
Init Binary: docker-init
containerd version: 4ab9917febca54791c5f071a9d1f404867857fcc
runc version: 54296cf40ad8143b62dbcaa1d90e520a2136ddfe
init version: 949e6fa
Security Options:
seccomp
Profile: default
Kernel Version: 4.10.8-1-ARCH
Operating System: Arch Linux
OSType: linux
Architecture: x86_64
CPUs: 8
Total Memory: 19.44 GiB
Name: cradlemanl
ID: JCHQ:CR5J:IR2P:BIIX:FTE4:J7P3:5YCK:3LXW:I7LP:EEBK:EKOL:IK7K
Docker Root Dir: /media/data/linux/docker
Debug Mode (client): false
Debug Mode (server): false
Registry: https://index.docker.io/v1/
Experimental: false
Insecure Registries:
127.0.0.0/8
Live Restore Enabled: false
Yeah, I suspected devicemapper
would be playing a role in this. I've never had luck running clusterdock
with devicemapper
as the storage backend driver. Try upgrading to aufs
or overlayfs
and you won't have any issues.
So I changed storage driver to overlayfs2 (Arch wiki recommends), deleted all images and pull it again. I have this error
!!! Parallel execution exception under host u'192.168.123.4':
Process 192.168.123.4:
Traceback (most recent call last):
File "/usr/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/usr/lib/python2.7/multiprocessing/process.py", line 114, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 242, in inner
submit(task.run(*args, **kwargs))
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/decorators.py", line 181, in inner
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 171, in __call__
return self.run(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/root/clusterdock/clusterdock/ssh.py", line 38, in _quiet_task
return run(command)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 677, in host_prompting_wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 1088, in run
shell_escape=shell_escape, capture_buffer_size=capture_buffer_size,
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 928, in _run_command
channel=default_channel(), command=wrapped_command, pty=pty,
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 418, in default_channel
chan = _open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 410, in _open_session
return connections[env.host_string].get_transport().open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 159, in __getitem__
self.connect(key)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 151, in connect
user, host, port, cache=self, seek_gateway=seek_gateway)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 603, in connect
raise NetworkError(msg, e)
NetworkError: Timed out trying to connect to 192.168.123.4 (tried 60 times)
Fatal error: One or more hosts failed while executing task '_quiet_task'
Underlying exception:
Timed out trying to connect to 192.168.123.4 (tried 60 times)
Aborting.
Exception in thread Thread-1:
Traceback (most recent call last):
File "/usr/lib/python2.7/threading.py", line 552, in __bootstrap_inner
self.run()
File "/usr/lib/python2.7/threading.py", line 505, in run
self.__target(*self.__args, **self.__kwargs)
File "/root/clusterdock/clusterdock/cluster.py", line 265, in start
raise Exception("Timed out waiting for {0} to become reachable.".format(self.hostname))
Exception: Timed out waiting for node-1 to become reachable.
!!! Parallel execution exception under host u'192.168.123.5':
Process 192.168.123.5:
Traceback (most recent call last):
File "/usr/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/usr/lib/python2.7/multiprocessing/process.py", line 114, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 242, in inner
submit(task.run(*args, **kwargs))
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/decorators.py", line 181, in inner
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 171, in __call__
return self.run(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/root/clusterdock/clusterdock/ssh.py", line 38, in _quiet_task
return run(command)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 677, in host_prompting_wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 1088, in run
shell_escape=shell_escape, capture_buffer_size=capture_buffer_size,
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 928, in _run_command
channel=default_channel(), command=wrapped_command, pty=pty,
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 418, in default_channel
chan = _open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 410, in _open_session
return connections[env.host_string].get_transport().open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 159, in __getitem__
self.connect(key)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 151, in connect
user, host, port, cache=self, seek_gateway=seek_gateway)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 603, in connect
raise NetworkError(msg, e)
NetworkError: Timed out trying to connect to 192.168.123.5 (tried 60 times)
Fatal error: One or more hosts failed while executing task '_quiet_task'
Underlying exception:
Timed out trying to connect to 192.168.123.5 (tried 60 times)
Aborting.
Exception in thread Thread-2:
Traceback (most recent call last):
File "/usr/lib/python2.7/threading.py", line 552, in __bootstrap_inner
self.run()
File "/usr/lib/python2.7/threading.py", line 505, in run
self.__target(*self.__args, **self.__kwargs)
File "/root/clusterdock/clusterdock/cluster.py", line 265, in start
raise Exception("Timed out waiting for {0} to become reachable.".format(self.hostname))
Exception: Timed out waiting for node-2 to become reachable.
INFO:clusterdock.cluster:Started cluster in 62.61 seconds.
I have this new lines in /etc/hosts
192.168.123.2 node-1.hadoop # Added by clusterdock
192.168.123.3 node-2.hadoop # Added by clusterdock
192.168.123.4 node-1.hadoop # Added by clusterdock
192.168.123.5 node-2.hadoop # Added by clusterdock
Run clusterdock_run ./bin/housekeeping nuke
to clean up the existing container clusters on your machine (as well as the /etc/hosts
file) and then try again.
I run this
clusterdock_run ./bin/housekeeping nuke
INFO:housekeeping:Removing all containers on this host...
INFO:housekeeping:Successfully removed all containers on this host.
INFO:housekeeping:Removing all user-defined networks on this host...
INFO:housekeeping:Successfully removed all user-defined networks on this host.
INFO:housekeeping:Clearing container entries from /etc/hosts...
INFO:housekeeping:Successfully cleared container entries from /etc/hosts.
INFO:housekeeping:Restarting Docker daemon...
INFO:housekeeping:Successfully nuked this host.
and have the same error
clusterdock_run ./bin/start_cluster -n hadoop cdh --include-service-type=HDFS,YARN,HIVE,HUE,OOZIE,SPARK --primary-node=node-1 --secondary-nodes=node-2
INFO:clusterdock.cluster:Network (hadoop) not present, creating it...
INFO:clusterdock.cluster:Successfully setup network (name: hadoop).
!!! Parallel execution exception under host u'192.168.123.2':
Process 192.168.123.2:
Traceback (most recent call last):
File "/usr/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/usr/lib/python2.7/multiprocessing/process.py", line 114, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 242, in inner
submit(task.run(*args, **kwargs))
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/decorators.py", line 181, in inner
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 171, in __call__
return self.run(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/root/clusterdock/clusterdock/ssh.py", line 38, in _quiet_task
return run(command)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 677, in host_prompting_wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 1088, in run
shell_escape=shell_escape, capture_buffer_size=capture_buffer_size,
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 928, in _run_command
channel=default_channel(), command=wrapped_command, pty=pty,
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 418, in default_channel
chan = _open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 410, in _open_session
return connections[env.host_string].get_transport().open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 159, in __getitem__
self.connect(key)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 151, in connect
user, host, port, cache=self, seek_gateway=seek_gateway)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 603, in connect
raise NetworkError(msg, e)
NetworkError: Timed out trying to connect to 192.168.123.2 (tried 60 times)
Fatal error: One or more hosts failed while executing task '_quiet_task'
Underlying exception:
Timed out trying to connect to 192.168.123.2 (tried 60 times)
Aborting.
Exception in thread Thread-1:
Traceback (most recent call last):
File "/usr/lib/python2.7/threading.py", line 552, in __bootstrap_inner
self.run()
File "/usr/lib/python2.7/threading.py", line 505, in run
self.__target(*self.__args, **self.__kwargs)
File "/root/clusterdock/clusterdock/cluster.py", line 265, in start
raise Exception("Timed out waiting for {0} to become reachable.".format(self.hostname))
Exception: Timed out waiting for node-1 to become reachable.
!!! Parallel execution exception under host u'192.168.123.3':
Process 192.168.123.3:
Traceback (most recent call last):
File "/usr/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/usr/lib/python2.7/multiprocessing/process.py", line 114, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 242, in inner
submit(task.run(*args, **kwargs))
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/decorators.py", line 181, in inner
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 171, in __call__
return self.run(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/root/clusterdock/clusterdock/ssh.py", line 38, in _quiet_task
return run(command)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 677, in host_prompting_wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 1088, in run
shell_escape=shell_escape, capture_buffer_size=capture_buffer_size,
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 928, in _run_command
channel=default_channel(), command=wrapped_command, pty=pty,
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 418, in default_channel
chan = _open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 410, in _open_session
return connections[env.host_string].get_transport().open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 159, in __getitem__
self.connect(key)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 151, in connect
user, host, port, cache=self, seek_gateway=seek_gateway)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 603, in connect
raise NetworkError(msg, e)
NetworkError: Timed out trying to connect to 192.168.123.3 (tried 60 times)
Fatal error: One or more hosts failed while executing task '_quiet_task'
Underlying exception:
Timed out trying to connect to 192.168.123.3 (tried 60 times)
Aborting.
Exception in thread Thread-2:
Traceback (most recent call last):
File "/usr/lib/python2.7/threading.py", line 552, in __bootstrap_inner
self.run()
File "/usr/lib/python2.7/threading.py", line 505, in run
self.__target(*self.__args, **self.__kwargs)
File "/root/clusterdock/clusterdock/cluster.py", line 265, in start
raise Exception("Timed out waiting for {0} to become reachable.".format(self.hostname))
Exception: Timed out waiting for node-2 to become reachable.
INFO:clusterdock.cluster:Started cluster in 77.61 seconds.
!!! Parallel execution exception under host u'192.168.123.2':
Process 192.168.123.2:
Traceback (most recent call last):
File "/usr/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/usr/lib/python2.7/multiprocessing/process.py", line 114, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 242, in inner
submit(task.run(*args, **kwargs))
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/decorators.py", line 181, in inner
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 171, in __call__
return self.run(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/root/clusterdock/clusterdock/ssh.py", line 45, in _task
return run(command)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 677, in host_prompting_wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 1088, in run
shell_escape=shell_escape, capture_buffer_size=capture_buffer_size,
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 928, in _run_command
channel=default_channel(), command=wrapped_command, pty=pty,
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 418, in default_channel
chan = _open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 410, in _open_session
return connections[env.host_string].get_transport().open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 159, in __getitem__
self.connect(key)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 151, in connect
user, host, port, cache=self, seek_gateway=seek_gateway)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 603, in connect
raise NetworkError(msg, e)
NetworkError: Timed out trying to connect to 192.168.123.2 (tried 60 times)
!!! Parallel execution exception under host u'192.168.123.3':
Process 192.168.123.3:
Traceback (most recent call last):
File "/usr/lib/python2.7/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/usr/lib/python2.7/multiprocessing/process.py", line 114, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 242, in inner
submit(task.run(*args, **kwargs))
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/decorators.py", line 181, in inner
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 171, in __call__
return self.run(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/tasks.py", line 174, in run
return self.wrapped(*args, **kwargs)
File "/root/clusterdock/clusterdock/ssh.py", line 45, in _task
return run(command)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 677, in host_prompting_wrapper
return func(*args, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 1088, in run
shell_escape=shell_escape, capture_buffer_size=capture_buffer_size,
File "/usr/local/lib/python2.7/dist-packages/fabric/operations.py", line 928, in _run_command
channel=default_channel(), command=wrapped_command, pty=pty,
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 418, in default_channel
chan = _open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/state.py", line 410, in _open_session
return connections[env.host_string].get_transport().open_session()
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 159, in __getitem__
self.connect(key)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 151, in connect
user, host, port, cache=self, seek_gateway=seek_gateway)
File "/usr/local/lib/python2.7/dist-packages/fabric/network.py", line 603, in connect
raise NetworkError(msg, e)
NetworkError: Timed out trying to connect to 192.168.123.3 (tried 60 times)
Fatal error: One or more hosts failed while executing task '_task'
Underlying exception:
Timed out trying to connect to 192.168.123.2 (tried 60 times)
Aborting.
/etc/hosts
192.168.123.2 node-1.hadoop # Added by clusterdock
192.168.123.3 node-2.hadoop # Added by clusterdock
Have you tried restarting your machine? Docker networking might be misbehaving if you have trouble resolving nodes like that.
Thank you very much!!! After reload everything works fine!!!
I have Intel Core i7 and 20 Gb Ram on my laptop. When I run
I always have this error.
I could connect to node-1 and node-2 and see all directories inside HDFS, but when I try to copy files from local to hdfs I always have this error
This command
sudo -u hdfs hdfs dfsadmin -report
give me this result