Closed csunclechen closed 4 years ago
I was trying to test the benchmark. I have two host servers and two alveo U280 cards. each host connect one card through pcie. And two cards connect the switch through qsfp28. Have you met this problem before?
Hi @csunclechen
Can you run this piece of code in the jupyter notebook?
for i in range(len(pynq.Device.devices)):
print("{}) {}".format(i, pynq.Device.devices[i].name))
If you get the a list of Alveo platforms everything is OK with pynq and XRT environment.
The next thing to consider is the the DaskDevice class, in particular the __init__
method. I am always giving name to the workers when creating the cluster. However, if no name is given to a worker, the default name is tcp://..
. This default name produces an error
Can you update the __init__
method of the DaskDevice
class, import import re
as well
def __init__(self, client, worker):
"""The worker ID should be unique
"""
worker_id= re.sub(r'[^\w]', '_', worker)
super().__init__("dask-" + worker_id)
self._dask_client = client
self._worker = worker
self.capabilities = {
'REGISTER_RW': True,
'CALLABLE': True
}
self._streams = {}
@csunclechen,
I was able to reproduce the issue and pushed a bug fix. Can you try the latest notebooks?
Exception in thread Thread-4: Traceback (most recent call last): File "/usr/lib/python3.6/threading.py", line 916, in _bootstrap_inner self.run() File "/usr/lib/python3.6/threading.py", line 864, in run self._target(*self._args, **self._kwargs) File "/home/cjj/.local/lib/python3.6/site-packages/pynq/pl_server/server.py", line 542, in server_proc server = Listener(self.socket_name, family='AF_UNIX', authkey=self.key) File "/usr/lib/python3.6/multiprocessing/connection.py", line 438, in init self._listener = SocketListener(address, family, backlog) File "/usr/lib/python3.6/multiprocessing/connection.py", line 576, in init self._socket.bind(address) FileNotFoundError: [Errno 2] No such file or directory
Exception in thread Thread-5: Traceback (most recent call last): File "/usr/lib/python3.6/threading.py", line 916, in _bootstrap_inner self.run() File "/usr/lib/python3.6/threading.py", line 864, in run self._target(*self._args, **self._kwargs) File "/home/cjj/.local/lib/python3.6/site-packages/pynq/pl_server/server.py", line 542, in server_proc server = Listener(self.socket_name, family='AF_UNIX', authkey=self.key) File "/usr/lib/python3.6/multiprocessing/connection.py", line 438, in init self._listener = SocketListener(address, family, backlog) File "/usr/lib/python3.6/multiprocessing/connection.py", line 576, in init self._socket.bind(address) FileNotFoundError: [Errno 2] No such file or directory
ConnectionError Traceback (most recent call last)