-
preliminary tasks, focusing on those which do not require the HPC:
- [x] basic file handling (images, transcriptions, models):
- [x] uploads
- [ ] moving
- [ ] sharing
- [ ] exports
…
-
I might be off-topic, since this project states this is for Nvidia and HPC focused jobs.
I was just wondering if it was possible (not asking you guys to do it, but at least tell me if it is) to mo…
-
Dear All,
Like to give this feedback on my HPC setup
I have controller and nodes working fine, the web interface is working fine , I am able to create and manage users through the web interface, b…
-
**Describe the bug**
My team runs a ruby script within a hpc (host process container). This works on containerd 1.6, but after upgrading to containerd 1.7 ruby fails to start with the below error mes…
-
I reinstall `pip install flash-attn==2.6.1` in NGC pytorch docker image 24.06.
When I run train job, I got follow error:
```
Traceback (most recent call last):
File "/data1/nfs15/nfs/bigdata/zha…
-
Hi,
I followed the update instructions detailed in #61 but Ive gotten a new issue importing now...
```
---------------------------------------------------------------------------
ImportErro…
-
Running into an issue when processing the test data set when pulling a singularity image with a singularity profile and an additional slurm profile:
`ERROR ~ Error executing process > 'SCATACPIPE:…
-
It would be a luxury to have a check on buildstock commands dependent on which HPC you are using. That is to say, a command like `buildstock_eagle` will not run if you are logged into Kestrel.
This…
-
Hi everyone, I found a bug while testing N10 LAMMPS in podman-hpc
Image: localhost:/n10-lammps:1.0
Run script:
```
podman-hpc run --gpu --mpi localhost/n10-lammps:1.0 /opt/lammps/install/bin/…
-
CX3-Pro cards are not supported in newer Mellanox OFED versions, and these cards are supported through Mellanox OFED LTS version (4.9-0.1.7.0). For more information, see [Linux Drivers](https://www.m…