-
Hello, after I downloaded orthofinder with conda, the following code appeared
For Linux 64, Open MPI is built with CUDA awareness but this support is disabled by default.
To enable it, please set th…
-
CC: @e10harvey
I have been noticing random test failures on the 'ats2' 'vortex' builds where the test shows no output and the return value is 255. For example, must today in the build:
* ` Tri…
-
**Environment:**
1. Framework: TensorFlow,
2. Framework version: 2.16
3. Horovod version: 0.28.1
4. MPI version:
5. CUDA version: 12.2
6. NCCL version:
7. Python version: 3.11.8
8. Spark / …
-
I was made aware of a Cuda IPC related issue when running a simple AthenaPK test case in parallel.
Given that the test case mostly exercises Parthenon base features, I'm opening an issue here to rais…
-
1. It would be useful to add a short test to the CommunicationBase setup. This way with each run we can be sure e.g. cuda-aware mpi and cuda p2p are working as intended.
2. Apparently the current cud…
-
Hello,
My apologies to trouble you with this. I've been trying to compile and run the ALPS example Knap with MPI.
Here are my steps:
1. `wget https://raw.githubusercontent.com/coin-or/coinbre…
-
## Bug Report
@trilinos/muelu
### Description
The `MueLu_UnitTestsTpetra_MPI_1` and `MueLu_UnitTestsTpetra_kokkos_MPI_{1,4}` tests are failing a couple checks in cuda/11.2 builds with UVM enable…
-
Hi,Does ROCM pytorch support distributed training with MPI backend?
Now pytorch can't work with MPI. The error information is as follows:
RuntimeError: CUDA tensor detected and the MPI used doesn't…
-
@trilinos/framework, @vbrunini, @jclause, @jhux2
## Internal Issues:
* [TRILINOSHD-114](sems-atlassian-son.sandia.gov/jira/servicedesk/customer/portal/7/TRILINOSHD-114)
## Description
The…
-
(Apparently @JunoRavin discovered this problem last summer, but it appears to be much more prevalent than previously believed.)
Running the CUDA build of the moment app with MPI enabled can easily …