-
I'm running an application on a cluster that uses CUDA Aware MPICH (v4.2.2) and UCX (v1.17.0). My application consists of two binaries, a server and a client, so I use the MPMD mode of `mpirun` to exe…
cl3to updated
2 weeks ago
-
Maybe I don't know how to read, but it certainly seems like I put `xtb` in `NWCHEM_MODULES`, yet it's not recognized:
```
jehammond@oppenheimer:~/NWChem/github/src$ echo $NWCHEM_MODULES
smallqm moi…
-
### Software versions
Python : 3.12.4 | packaged by conda-forge | (main, Jun 17 2024, 10:23:07) [GCC 12.3.0]
Platform : Linux-5.4.0-169-generic-x86_64-with-glibc2.31
Legion : legi…
-
**Description**
I have been successfully running gesv using GPU-aware MPI on an AMD machine with HIP (Setonix @ Pawsey Supercomputing Centre Australia). But I am getting seg faults trying to do the s…
-
### System Info
- CPU x86_64
### Who can help?
_No response_
### Information
- [ ] The official example scripts
- [ ] My own modified scripts
### Tasks
- [ ] An officially supported task in the…
jlewi updated
4 weeks ago
-
## Background information
### What version of Open MPI are you using? (e.g., v3.0.5, v4.0.2, git branch name and hash, etc.)
4.0.5 shipped with Nvidia hpc_sdk 21.2
### Please describe the sys…
-
Hi,
We have a supercomputer with an Omni-Path network which requires that `cudaSetDevice` is called before `MPI_Init` for GPUDirect to work.
Some of our users are using Kokkos on our facility an…
-
**DONE:**
- [ ] profile of `tests/sp2n/Test_hmc_Sp_WilsonFundFermionGauge.cc` done and stored in shared folder
- [x] `nsys` GPU
- [ ] `ncu` GPU
- [ ] CPU (vtune, muProf)
-
### Describe the bug
On multiple GPU systems, using HIP or CUDA, a process is spawned on all GPUs instead being spawned only on one of them. (See To reproduce section)
This result in memory leak…
-
Hello,
I am running some latency benchmarks using TensorRT-LLM on a Mistral 7B Instruct v0.3 model. My hope was that at small batch sizes the overall inference latency should not be impacted as much,…