-
### Bug description
Hello,
i am using `SLURMEnvironment` plugin to resubmit jobs automatically. So far it has been working seamlessly
on my academic cluster, but recently when the auto-requeue si…
-
I have been using the flowButton component on a home page to launch a screen flow. Stating in Winter '23, the button is not working correctly. The flow launches but does not get passed a first scree…
-
### Bug description
Hi,
I am currently testing with IterableDataset and DDP.
Total Examples - ```10000```
Batch_size - ```32```
NUM_GPUS - ```2``` .
While using IterableDataset , ideally w…
-
Hi all, when running `openwebtext_trainer.py` with default settings, the program crashes after 6k steps with following message:
```
(.venv) slkv@slkv-pc:~/Projects/project_brain$ python ./src/pret…
-
## _Who_ is the bug affecting?
Consumers of [LightningWebChartJS](https://github.com/SalesforceLabs/LightningWebChartJS) repo individually with custom deploy of components.
## _What_ is affecte…
-
Hello,when i am running the code
`from pts.model.tempflow import TempFlowEstimator
from pts.model.time_grad import TimeGradEstimator
from pts.model.transformer_tempflow import TransformerTempFlowE…
-
### Bug description
When using DeepSpeed, the changes of checkpoint (add/remove key) in `on_save_checkpoint` are not being preserved. Switching strategy to `ddp`, the changes are saved as expected.
…
-
-
Currently, there is no ergonomic way for a test author to mock the behavior of an `@wire` dependency. A mechanism should be built so that the test environment can directly call or modify the state of …
-
### Bug description
When saving a checkpoint at `every_n_train_steps=3`, it performs the checkpoint saving [inside on_train_batch_end](https://github.com/Lightning-AI/lightning/blob/master/src/lightn…