-
### Describe the bug
We have a couple of ARM (Thunder) nodes to play with before we get our A64fx cluster. The nodes have OFED5 on them and mlx5 devices. I get free passive progress from OFED4 + …
-
iser discovery seems not fully implemented, I can't find an example for it. Therefor I simply modify from iscsi-ls, change iscsi:// to iser://
Then got segmentation fault after iscsi context free.
clicx updated
5 years ago
-
Background: ib_mad is the result ko which is linked to cm.o by ld. This is a oot-module driver which is not a part of kernel.
Please point me out if you find any wrong of my description or in my un…
-
I haven't examined the code yet. As I understand, it's single threaded, right?
If so, provided that by default masscan is already quite fast and can end up using up all the bandwidth of a network, …
-
Thank you for taking the time to submit an issue!
## Background information
### What version of Open MPI are you using? (e.g., v3.0.5, v4.0.2, git branch name and hash, etc.)
v3.1.5
### De…
-
Since #30, EXtra-data can open multiple files in parallel when initially opening a run. But if we then need to pull data from many files - sequence files or detector modules - we still handle these on…
-
Dear GPI maintainers,
I'm playing around with fault tolerance and what GASPI's timeout feature can do to make a program survive a rank failure.
In a test program I can identify which rank died but I…
-
Hello!
It seems that something has changed in Linux 4.9 regarding the way it represents bonded Mellanox interfaces which leads to broken offloading functionality of VMA for teamed interfaces.
[r…
-
I am using OSU benchmarks to understand the performance of OpenMPI+UCX with CUDA+Infiniband transfers. I see some numbers that I do not understand, and would appreciate some guidance. I am running on …
-
See for setup details: https://github.com/dmlc/xgboost/issues/6232
Last thing in logs before hang is:
```
[1603364135.700317] [mr-dl10:19153:0] sock.c:344 UCX ERROR recv(fd=145) fai…