mpi-cuda Search Results

1000+ results
for mpi-cuda

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openucx/ucx #10085

Address Registration Error in CUDA Aware MPICH 4.2.2 + UCX 1…

I'm running an application on a cluster that uses CUDA Aware MPICH (v4.2.2) and UCX (v1.17.0). My application consists of two binaries, a server and a client, so I use the MPMD mode of `mpirun` to exe…

cl3to updated 2 weeks ago
7
nwchemgit/nwchem #1007

xtb in NWCHEM_MODULES not recognized

Maybe I don't know how to read, but it certainly seems like I put `xtb` in `NWCHEM_MODULES`, yet it's not recognized: ``` jehammond@oppenheimer:~/NWChem/github/src$ echo $NWCHEM_MODULES smallqm moi…

jeffhammond updated 2 months ago
1
nv-legate/legate.core #956

[BUG] Wrong result when summing on Pascal GPUs

### Software versions Python : 3.12.4 | packaged by conda-forge | (main, Jun 17 2024, 10:23:07) [GCC 12.3.0] Platform : Linux-5.4.0-169-generic-x86_64-with-glibc2.31 Legion : legi…

suranap updated 2 weeks ago
15
icl-utk-edu/slate #154

Segmentation fault in slate::gesv when using CUDA-aware MPI

**Description** I have been successfully running gesv using GPU-aware MPI on an AMD machine with HIP (Setonix @ Pawsey Supercomputing Centre Australia). But I am getting seg faults trying to do the s…

liamscarlett updated 3 months ago
2
NVIDIA/TensorRT-LLM #1959

Is MPI required even multi device is disabled?

### System Info - CPU x86_64 ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported task in the…

jlewi updated 4 weeks ago
5
open-mpi/ompi #8720

Derived types with CUDA-Aware MPI

## Background information ### What version of Open MPI are you using? (e.g., v3.0.5, v4.0.2, git branch name and hash, etc.) 4.0.5 shipped with Nvidia hpc_sdk 21.2 ### Please describe the sys…

ShatrovOA updated 2 years ago
1
kokkos/kokkos #3704

Initialization order of Kokkos, CUDA, MPI

Hi, We have a supercomputer with an Omni-Path network which requires that `cudaSetDevice` is called before `MPI_Init` for GPUDirect to work. Some of our users are using Kokkos on our facility an…

RemiLacroix-IDRIS updated 3 years ago
2
UCL-ARC/Grid #4

Profile test case

**DONE:** - [ ] profile of `tests/sp2n/Test_hmc_Sp_WilsonFundFermionGauge.cc` done and stored in shared folder - [x] `nsys` GPU - [ ] `ncu` GPU - [ ] CPU (vtune, muProf)

ilectra updated 4 days ago
1
intel/llvm #15251

[CUDA][HIP] too many process spawned on multiple GPU systems

### Describe the bug On multiple GPU systems, using HIP or CUDA, a process is spawned on all GPUs instead being spawned only on one of them. (See To reproduce section) This result in memory leak…

tdavidcl updated 1 month ago
8
NVIDIA/TensorRT-LLM #2240

Linear increase in latency with batch size

Hello, I am running some latency benchmarks using TensorRT-LLM on a Mistral 7B Instruct v0.3 model. My hope was that at small batch sizes the overall inference latency should not be impacted as much,…

mkserge updated 2 days ago
3

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for mpi-cuda

1000+ results
for mpi-cuda