-
Hello,
We are currently using Toil python API run a series of molecular simulations (within a python script) on a Slurm cluster. The python script is submitted through `sbatch` which specifies a max…
-
Thanks for the great work!
Could you provide more details on how to set the configs to run experiments on different growth operators as described in the paper (other than G_stack)?
-
To run MPMD I use the classic form: `mpirun -n 4 exe1 : -n 4 exe2 : ...`.
I normally split the world communicator into a communicator for each executable using `MPI_APPNUM` (Fortran):
```fortran
…
b-fg updated
8 months ago
-
Dear Support,
I am using opnempi version 4.1.5 installed using spack
[root@login-cluster-1 hello-world]# mpirun --version
mpirun (Open MPI) 4.1.5
If i run a simple hello using mpirun everyth…
-
## Rationale
Some cluster setups require the use of `srun` or special `mpirun` to handle executions. `MOOSE_MPI_COMMAND` was implemented way back when to handle this for `RunApp` way back when (htt…
-
I created an image which contains a conda environment where I installed some packages from github.
When I run the image with docker everything is ok, the packages are all there.
![Image](https://gith…
-
In intra-node collective communication works well via NCCL(H100 HGX server with NVswitch), but we encountered below error in terms of infiniband device error for inter-node communication(GPU Direct RD…
-
## 🐛 Bug Report
### How To Reproduce
Steps to reproduce the behavior:
1. Go to '...'
1. Run '....'
3. ...
srun: launch/slurm: launch_p_step_launch: CpuBindType=(null type)
srun: launchi…
-
### What would you like to report?
This issue was raised from this discussion: https://github.com/Quantum-Accelerators/quacc/discussions/2491
The code to reproduce this issue:
```
from jobflow_r…
-
I would like to use Enroot containers to provide toolchain environments for Slurm, i.e. as a sort of substitute for lmod modules. A typical example are NVIDIA Container images which can contain source…