-
## Background information
### What version of the PMIx Reference Library are you using?
4.2.8
### Describe how PMIx was installed
tarball
### Please describe the system on which you a…
-
_From @edsko on September 24, 2012 12:54_
which closes the entire "bundle" of (outgoing _and_ incoming) connections to another endpoint. Basically, "disconnect completely from this other endpoint" (a…
mboes updated
1 month ago
-
## Background information
Writing to single shared file from many process triggers systematically an error linked to the underlying UCS
module from the UCX library.
### OpenMPI version 4.1.5
…
-
Dear GPI maintainers,
I'm playing around with fault tolerance and what GASPI's timeout feature can do to make a program survive a rank failure.
In a test program I can identify which rank died but I…
-
We have a more complete picture of the hardware layout c/o John-Paul. I have it in a powerpoint file on Slack in #datascience.
At the least we need to add
- uplinks to the internet for both on a…
-
current bonding code will run the apply on alphabetically sorted interface names.
for bond interfaces, the apply on the bond interface should run after the apply on the slave interfaces. (or try t…
-
Hi there, my MPI program hits a seg fault when running on a single Infiniband-enabled node. I'm trying to understand whether it's related to this issue: https://github.com/open-mpi/ompi/issues/6666.
…
-
Hello,
After add rules (i use this file : https://github.com/treydock/infiniband_exporter/blob/main/examples/infiniband.rules), Prometheus log show many warning message.
Is it normal ?
`…
-
I have been using `dask`+`distributed` for a while: I have a python script running every morning that launches a `LocalCluster(n_workers=4)`, load some data, process it a bit, persist and publish as a…
-
Not sure this is indeed a feature request / issue /..., but probably more like a request for comments how one could handle my current usecase:
Instead of specifying network options on the cmdline I…