-
Schedmd release slurm exporter support in Slinky project https://github.com/SlinkyProject/slurm-exporter
-
### Bug description
Would it be possible for Lightning to raise an error if `SLURM_NTASKS != SLURM_NTASKS_PER_NODE` in case both are set?
With a single node the current behavior is:
* `SLURM_NT…
-
cat Bnanew_cactusv2.9.0_69.err
[2024-10-16T21:35:18+0800] [MainThread] [I] [toil.statsAndLogging] Enabling realtime logging in Toil
[2024-10-16T21:35:18+0800] [MainThread] [W] [toil.lib.humanize] De…
-
I have tried to submit a number of jobs using slurm, and in every case the jobs stall, failing to produce any output. This holds for single-node jobs, multi-node jobs, realistic problems and simple te…
-
Potentially use the submit it plugin of hydra https://hydra.cc/docs/plugins/submitit_launcher/
-
## Context
Check https://github.com/ansys/pymapdl/pull/2865 for a bit of historical context, which lead to https://github.com/ansys/pymapdl/pull/3091.
In #2865 we proposed implementing PyHPS to inte…
-
Hi,
When i run the code in the vignette "supervised_marker_map = MarkerMap(
adata.X.shape[1],
hidden_layer_size,
z_size,
len(adata.obs[group_by].unique()),
k,
loss_tra…
-
Following #1, and recent discussions, we should start thinking about implementing a SLURM client that is at least equivalent to the PBS implementation.
See https://slurm.schedmd.com/rosetta.pdf for…
-
## Summary
Slurm test is failing:
` /opt/outer/walkthrough-tests.sh: line 45: singularity: command not found`
## Additional Details
Likely this is caused by an earlier error:
`Could not…
-
### Bug Description
charmed-hpc should prevent jobs from exhausting all memory on the system by default.
1) Currently, job allocations can exhaust all available memory on the system. Set `MemSpe…