-
0:00 - Introduction
0:42 - Sponsors & Contact information
1:31 - Transformers from the ground up
2:37 - Examples for transformers
4:48 - Outline
6:29 - Disclaimer
7:11 - Augmenting RNNs with at…
-
Dear All,
I would like to run ALIGNN on multi GPUs. When I checked the code I could not find any option.
Is there any method to run ALIGNN on multi GPUs such as using PyTorch Lightning or DDP fu…
-
Just wanted to report a crash while training.
**Error message:** `[process exited with code 1 (0x00000001)]`
**Command i used to start the process:** `ACCELERATE_LOG_LEVEL=info accelerate launch…
-
Hello,
Thank you for sharing your work!
I noticed that the `finetune.sh` and `finetune_fsdp.sh` use regular training by default. Should I change it to `zo` to enable MeZO trainer? Also, I'm gett…
-
### Feature request
Some of our models interpolate its positional embeddings, enabling pretrained checkpoints to be used on different input resolutions. For example, [here in ViT](https://github.co…
-
### What happened + What you expected to happen
I converted existing code working on 2.7 to 2.20 (new api)
The error:
File "/opt/project/trading/training/model/rl/multi_agent/ppo/equity/trainer…
-
## 🐛 Bug
We're seeing a non-deterministic error which occurs during a torch lightning train when we adopt a remote AIM repo for logging (i.e. setting `repo="aim://our-aim-server:53800/"` when initi…
-
Howdy folks,
Say we trained a GP on data with some known standard deviation `d1`. Then we deploy this GP in the world, but with sensors that might be noisier that the one from which the training da…
-
**Describe the bug**
Hi, Authors. My code seems to hang when skip_remainder_batch=False.
**To Reproduce**
Steps to reproduce the behavior:
```
git clone https://github.com/microsoft/tutel --b…
-
Hi, i met the issue. Thank you.
`(venv) root@autodl-container-8a50119a52-f09cc96a:~/autodl-tmp/cvt2distilgpt2# dlhpcstarter -t iu_x-ray -c config/train_iu_x_ray_chen_cvt2distilgpt2.yaml --stages_mo…