Bump pytorch-lightning from 1.5.9 to 1.6.1

Bumps pytorch-lightning from 1.5.9 to 1.6.1.

Release notes

Sourced from pytorch-lightning's releases.

Standard weekly patch release

[1.6.1] - 2022-04-13

Changed

Support strategy argument being case insensitive (#12528)

Fixed

Run main progress bar updates independent of val progress bar updates in TQDMProgressBar (#12563)

Avoid calling average_parameters multiple times per optimizer step (#12452)

Properly pass some Logger's parent's arguments to super().__init__() (#12609)

Fixed an issue where incorrect type warnings appear when the overridden LightningLite.run method accepts user-defined arguments (#12629)

Fixed rank_zero_only decorator in LSF environments (#12587)

Don't raise a warning when nn.Module is not saved under hparams (#12669)

Raise MisconfigurationException when the accelerator is available but the user passes invalid ([]/0/"0") values to the devices flag (#12708)

Support auto_select_gpus with the accelerator and devices API (#12608)

Contributors

@akihironitta @awaelchli @Borda @carmocca @kaushikb11 @krshrimali @mauvilsa @otaj @pre-commit-ci @rohitgr7 @semaphore-egg @tkonopka @wayi1

If we forgot someone due to not matching the commit email with the GitHub account, let us know :]

PyTorch Lightning 1.6: Support Intel's Habana Accelerator, New efficient DDP strategy (Bagua), Manual Fault-tolerance, Stability and Reliability.

The core team is excited to announce the PyTorch Lightning 1.6 release ⚡

Highlights

Backward Incompatible Changes

Full Changelog

Contributors

Highlights

PyTorch Lightning 1.6 is the work of 99 contributors who have worked on features, bug-fixes, and documentation for a total of over 750 commits since 1.5. This is our most active release yet. Here are some highlights:

Introducing Intel's Habana Accelerator

Lightning 1.6 now supports the Habana® framework, which includes Gaudi® AI training processors. Their heterogeneous architecture includes a cluster of fully programmable Tensor Processing Cores (TPC) along with its associated development tools and libraries and a configurable Matrix Math engine.

You can leverage the Habana hardware to accelerate your Deep Learning training workloads simply by passing:
trainer = pl.Trainer(accelerator="hpu")
single Gaudi training
trainer = pl.Trainer(accelerator="hpu", devices=1)
</tr></table>

... (truncated)

Changelog

Sourced from pytorch-lightning's changelog.

[1.6.1] - 2022-04-13

Changed

Support strategy argument being case insensitive ([#12528](PyTorchLightning/pytorch-lightning#12528

Fixed

Run main progress bar updates independent of val progress bar updates in TQDMProgressBar (#12563)

Avoid calling average_parameters multiple times per optimizer step (#12452)

Properly pass some Logger's parent's arguments to super().__init__() (#12609)

Fixed an issue where incorrect type warnings appear when the overridden LightningLite.run method accepts user-defined arguments (#12629)

Fixed rank_zero_only decorator in LSF environments (#12587)

Don't raise a warning when nn.Module is not saved under hparams (#12669)

Raise MisconfigurationException when the accelerator is available but the user passes invalid ([]/0/"0") values to the devices flag (#12708)

Support auto_select_gpus with the accelerator and devices API (#12608)

[1.6.0] - 2022-03-29

Added

Allow logging to an existing run ID in MLflow with MLFlowLogger (#12290)

Enable gradient accumulation using Horovod's backward_passes_per_step (#11911)

Add new DETAIL log level to provide useful logs for improving monitoring and debugging of batch jobs (#11008)

Added a flag SLURMEnvironment(auto_requeue=True|False) to control whether Lightning handles the requeuing (#10601)

Fault Tolerant Manual

Add _Stateful protocol to detect if classes are stateful (#10646)

Add _FaultTolerantMode enum used to track different supported fault tolerant modes (#10645)

Add a _rotate_worker_indices utility to reload the state according the latest worker (#10647)

Add stateful workers (#10674)

Add an utility to collect the states across processes (#10639)

Add logic to reload the states across data loading components (#10699)

Cleanup some fault tolerant utilities (#10703)

Enable Fault Tolerant Manual Training (#10707)

Broadcast the _terminate_gracefully to all processes and add support for DDP (#10638)

Added support for re-instantiation of custom (subclasses of) DataLoaders returned in the *_dataloader() methods, i.e., automatic replacement of samplers now works with custom types of DataLoader (#10680)

Added a function to validate if fault tolerant training is supported. (#10465)

Added a private callback to manage the creation and deletion of fault-tolerance checkpoints (#11862)

Show a better error message when a custom DataLoader implementation is not well implemented and we need to reconstruct it (#10719)

Show a better error message when frozen dataclass is used as a batch (#10927)

Save the Loop's state by default in the checkpoint (#10784)

Added Loop.replace to easily switch one loop for another (#10324)

Added support for --lr_scheduler=ReduceLROnPlateau to the LightningCLI (#10860)

Added LightningCLI.configure_optimizers to override the configure_optimizers return value (#10860)

Added LightningCLI(auto_registry) flag to register all subclasses of the registerable components automatically (#12108)

Added a warning that shows when max_epochs in the Trainer is not set (#10700)

Added support for returning a single Callback from LightningModule.configure_callbacks without wrapping it into a list (#11060)

Added console_kwargs for RichProgressBar to initialize inner Console (#10875)

Added support for shorthand notation to instantiate loggers with the LightningCLI (#11533)

... (truncated)

Commits

0b0f1ec Update CHANGELOG.md
2187f30 Support auto_select_gpus with accelerator and devices api (#12608)
d95447a Update deepspeed precision test (#12727)
83e0c4a Raise MisconfigurationException when the accelerator is available but… (#12...
ba1e869 CI: check docker requires (#12677)
dfd9730 fix import failer (#12676)
6aca974 Run main progress bar independent of val progress bar in TQDMProgressBar (#...
23d3d46 Fix flaky test, that is not consistent on some configurations (#12707)
8bc4a76 Update LightningCLI tests to reflect changes in jsonargparse 4.6.0 (#12704)
a4026fe Fix a typo in warning message inside Trainer.reset_train_dataloader (#12645)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

ray-project / ray_lightning

Bump pytorch-lightning from 1.5.9 to 1.6.1 #140

Standard weekly patch release

[1.6.1] - 2022-04-13

Changed

Fixed

Contributors

PyTorch Lightning 1.6: Support Intel's Habana Accelerator, New efficient DDP strategy (Bagua), Manual Fault-tolerance, Stability and Reliability.

Highlights

Introducing Intel's Habana Accelerator

single Gaudi training

[1.6.1] - 2022-04-13

Changed

Fixed

[1.6.0] - 2022-03-29

Added