-
the tutorial is broken at the moment as it pins to aws-parallelcluster==2.10.1
which requires Slurm 20.02.4 as per https://docs.aws.amazon.com/parallelcluster/latest/ug/schedulers.slurm.html
whi…
-
The Slurm_lapply function fails, reporting job ids of NA, when run on a federated SLURM cluster. In this case the parallel slurm jobs were successfully started, but the parent process failed to parse …
-
### 🐛 Describe the bug
I am running librispeech recipe with distributed mode using slurm on esonet2. i am running on two oracle instance each one has single gpu (Tesla V100). but when i ran stage 11 …
-
OrthoFinder version 2.5.5 Copyright (C) 2014 David Emms
2024-06-02 15:53:19 : Starting OrthoFinder 2.5.5
64 thread(s) for highly parallel tasks (BLAST searches etc.)
8 thread(s) for OrthoFinder a…
-
如果和Slurm Power Saving配置不一样,请告知节能具体怎么配置。
-
# 记一次Slurm集群部署 | Broken Memories
记一次Slurm集群环境部署 工作中遇到了好几次基于Slurm部署HPC计算集群的工作。因此在这里……
[https://yuukisama.cc/posts/%E8%AE%B0%E4%B8%80%E6%AC%A1slurm%E9%9B%86%E7%BE%A4%E9%83%A8%E7%BD%B2/](https://yuukis…
-
### Bug description
Hello! When I train with DDP strategy, any type of crashes like `Out Of Memory (OOM)` error or `scancel` slurm job results in slurm nodes to drain due to `Kill task failed` which …
-
Hello,
When I call scancel to cancel all tasks submitted to a slurm cluster by future.batchtools then hit CTRL-C to get the terminal back, R displays an error per task and is painfully slow to come b…
-
Hello,
I am supporting a user @andresp-wave at our HPC center who is using qopen. He is allocating 32 CPUs through Slurm and setting the the njobs in qopen config file to either null or to 32. On h…
-
Hello and apologies if this question is in the wrong place. We are upgrading from Debian 8 to Debian 11. I am a developer with no particular background in system administration or configuration. Se…