-
### What happened + What you expected to happen
I can't start ray.
I instantiate a node in a slurm cluster using:
srun -n 1 --exclusive -G 1 --pty bash
This allocates a node with 112 cpus …
-
### Your name
Melissa Sulprizio
### Your affiliation
Harvard
### What happened? What did you expect to happen?
My latest GCHP integration tests for PR #2510 indicate all simulations passed:
``…
-
After running the model for a period of time, an error occurred: ERRENG = 1.0557652E-02 at i,j: 24 49
Net solar: 415.1313
Net longwave: 5511.5645
Total sensible…
-
just curious..why is `srun` needed here?
https://github.com/sabeenlohawala/tissue_labeling/blob/f88c8a08ccb72e2c4cde1867337b2d46f7e64556/submit_multi_gpu.sh#L20
-
Hi,
What is that `srun`? I don't see that file after make
```
[mahmood@rocks7 mpiBench]$ ls
crunch_mpiBench makefile mpiBench.c README.md
[mahmood@rocks7 mpiBench]$ make
rm -f mpiBench *.o
…
-
When submitting jobs on a GPU node, the generated batch job requests GRES that is not valid:
```
srun: error: Unable to create step for job 16413634: Invalid generic resource (gres) specification
…
-
Summarizing the situation here (Apr 16):
1. We can run pytorch_lightning with a single gpu as long as the strategy for trainer is "auto" (default) (without srun)
2. It fails when the strategy is "…
-
Good afternoon all
`glogin` (GWDG Emmy) has undergone some hardware and software upgrades recently. Since the upgrade, I find jobs launched with `srun` are considerably slower than jobs launched w…
-
With the project winding down, it is time to define a stable landing point where we can leave it for those wanting to use it. This means:
- removing all stale code, particularly components that are…
-
While trying to run an `e4s-cl init` I received an error that said it was an e4s-cl bug, and to report the contents of a debug file on github. Below is the pasted contents of the file:
```
$ cat /h…