-
python train.py \
> --dataset mpiigaze \
> --snapshot output/snapshots \
> --gpu 0 \
> --num_epochs 50 \
> --batch_size 16 \
> --lr 0.00001 \
> --arch ResNet101 \
> --alpha 1
Loadin…
-
**What would you like to be added**:
I'd like to support the MultiKueue for the plain pod the same as the Job and JobSet.
**Why is this needed**:
In general multi tenant clusters not for ML/HPC, we…
-
Platforms: rocm
This test was disabled because it is failing on main branch ([recent examples](https://torch-ci.com/failure?failureCaptures=%5B%22test_transformers.py%3A%3ATestSDPACudaOnlyCUDA%3A%3…
-
### System Info
Debian 11
`nvidia-smi`
```
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.161.07 Driver Version: 535.16…
-
Definitions
-----------
All replicas will refer to the dataset (project) they are part of as one of their hubs.
TBD: Should we make the hub_ids field of each replica a list of entity references…
-
Hello!
The following currently errors out:
```
eqx.error_if((), jnp.array(False), "Errors?")
```
with the error:
```
.venv/lib/python3.11/site-packages/equinox/_errors.py:229: in error_if
…
-
### 🐛 Describe the bug
I was trying to run Bert model training on ICELAKE CPU with torch.compile mode then it is giving a value error, but when i am running it with eager mode then it is running fi…
-
Hello,
For some reason,my RNA-seq data(TPM and FPKM)came from different batches,so my question is:
1 Do I need remove batch effect when I use xcell
2 If needed ,Whether negative values are permi…
-
## Feature Request
New pub style analogous to fetch for pull based consumers to allow multiple messages for a stream to be published with a single call and all succeed or fail (effectively a transa…
-
What am I doing wrong? I did follow https://ufoai.org/wiki/Compile_for_Windows but didn't install mingw-w64-i686-googletest because it said it doesn't exist.
I downloaded msys2 from here https://www.…