The parallel version runs on dev02 but not on pearl. There are some issues with importing libraries. The conda environment in both cases is identical:
On dev02 the following import produces no error:
from torch_geometric.data import DataListLoader, DataLoader
On pearl there is an error:
from torch_geometric.data import DataLoader
Traceback (most recent call last):
File "", line 1, in
File "/mnt/beegfs/home/pearl061/.conda/envs/hydronet_test/lib/python3.8/site-packages/torch_geometric/init.py", line 4, in
import torch_geometric.data
File "/mnt/beegfs/home/pearl061/.conda/envs/hydronet_test/lib/python3.8/site-packages/torch_geometric/data/init.py", line 1, in
from .data import Data
File "/mnt/beegfs/home/pearl061/.conda/envs/hydronet_test/lib/python3.8/site-packages/torch_geometric/data/data.py", line 20, in
from torch_sparse import SparseTensor
File "/mnt/beegfs/home/pearl061/.conda/envs/hydronet_test/lib/python3.8/site-packages/torch_sparse/init.py", line 19, in
torch.ops.load_library(spec.origin)
File "/mnt/beegfs/home/pearl061/.conda/envs/hydronet_test/lib/python3.8/site-packages/torch/_ops.py", line 643, in load_library
ctypes.CDLL(path)
File "/mnt/beegfs/home/pearl061/.conda/envs/hydronet_test/lib/python3.8/ctypes/init.py", line 373, in init
self._handle = _dlopen(self._name, mode)
OSError: libtorch_cuda_cu.so: cannot open shared object file: No such file or directory
The parallel version runs on dev02 but not on pearl. There are some issues with importing libraries. The conda environment in both cases is identical:
On dev02 the following import produces no error: from torch_geometric.data import DataListLoader, DataLoader
On pearl there is an error: