ray-project / ray_lightning

Pytorch Lightning Distributed Accelerators using Ray
Apache License 2.0
211 stars 34 forks source link

Bump pytorch-lightning from 1.5.9 to 1.6.1 #140

Closed dependabot[bot] closed 2 years ago

dependabot[bot] commented 2 years ago

Bumps pytorch-lightning from 1.5.9 to 1.6.1.

Release notes

Sourced from pytorch-lightning's releases.

Standard weekly patch release

[1.6.1] - 2022-04-13

Changed

  • Support strategy argument being case insensitive (#12528)

Fixed

  • Run main progress bar updates independent of val progress bar updates in TQDMProgressBar (#12563)
  • Avoid calling average_parameters multiple times per optimizer step (#12452)
  • Properly pass some Logger's parent's arguments to super().__init__() (#12609)
  • Fixed an issue where incorrect type warnings appear when the overridden LightningLite.run method accepts user-defined arguments (#12629)
  • Fixed rank_zero_only decorator in LSF environments (#12587)
  • Don't raise a warning when nn.Module is not saved under hparams (#12669)
  • Raise MisconfigurationException when the accelerator is available but the user passes invalid ([]/0/"0") values to the devices flag (#12708)
  • Support auto_select_gpus with the accelerator and devices API (#12608)

Contributors

@​akihironitta @​awaelchli @​Borda @​carmocca @​kaushikb11 @​krshrimali @​mauvilsa @​otaj @​pre-commit-ci @​rohitgr7 @​semaphore-egg @​tkonopka @​wayi1

If we forgot someone due to not matching the commit email with the GitHub account, let us know :]

PyTorch Lightning 1.6: Support Intel's Habana Accelerator, New efficient DDP strategy (Bagua), Manual Fault-tolerance, Stability and Reliability.

The core team is excited to announce the PyTorch Lightning 1.6 release ⚡

Highlights

PyTorch Lightning 1.6 is the work of 99 contributors who have worked on features, bug-fixes, and documentation for a total of over 750 commits since 1.5. This is our most active release yet. Here are some highlights:

Introducing Intel's Habana Accelerator

Lightning 1.6 now supports the Habana® framework, which includes Gaudi® AI training processors. Their heterogeneous architecture includes a cluster of fully programmable Tensor Processing Cores (TPC) along with its associated development tools and libraries and a configurable Matrix Math engine.

You can leverage the Habana hardware to accelerate your Deep Learning training workloads simply by passing:

trainer = pl.Trainer(accelerator="hpu")

single Gaudi training

trainer = pl.Trainer(accelerator="hpu", devices=1) </tr></table>

... (truncated)

Changelog

Sourced from pytorch-lightning's changelog.

[1.6.1] - 2022-04-13

Changed

Fixed

  • Run main progress bar updates independent of val progress bar updates in TQDMProgressBar (#12563)
  • Avoid calling average_parameters multiple times per optimizer step (#12452)
  • Properly pass some Logger's parent's arguments to super().__init__() (#12609)
  • Fixed an issue where incorrect type warnings appear when the overridden LightningLite.run method accepts user-defined arguments (#12629)
  • Fixed rank_zero_only decorator in LSF environments (#12587)
  • Don't raise a warning when nn.Module is not saved under hparams (#12669)
  • Raise MisconfigurationException when the accelerator is available but the user passes invalid ([]/0/"0") values to the devices flag (#12708)
  • Support auto_select_gpus with the accelerator and devices API (#12608)

[1.6.0] - 2022-03-29

Added

  • Allow logging to an existing run ID in MLflow with MLFlowLogger (#12290)
  • Enable gradient accumulation using Horovod's backward_passes_per_step (#11911)
  • Add new DETAIL log level to provide useful logs for improving monitoring and debugging of batch jobs (#11008)
  • Added a flag SLURMEnvironment(auto_requeue=True|False) to control whether Lightning handles the requeuing (#10601)
  • Fault Tolerant Manual
    • Add _Stateful protocol to detect if classes are stateful (#10646)
    • Add _FaultTolerantMode enum used to track different supported fault tolerant modes (#10645)
    • Add a _rotate_worker_indices utility to reload the state according the latest worker (#10647)
    • Add stateful workers (#10674)
    • Add an utility to collect the states across processes (#10639)
    • Add logic to reload the states across data loading components (#10699)
    • Cleanup some fault tolerant utilities (#10703)
    • Enable Fault Tolerant Manual Training (#10707)
    • Broadcast the _terminate_gracefully to all processes and add support for DDP (#10638)
  • Added support for re-instantiation of custom (subclasses of) DataLoaders returned in the *_dataloader() methods, i.e., automatic replacement of samplers now works with custom types of DataLoader (#10680)
  • Added a function to validate if fault tolerant training is supported. (#10465)
  • Added a private callback to manage the creation and deletion of fault-tolerance checkpoints (#11862)
  • Show a better error message when a custom DataLoader implementation is not well implemented and we need to reconstruct it (#10719)
  • Show a better error message when frozen dataclass is used as a batch (#10927)
  • Save the Loop's state by default in the checkpoint (#10784)
  • Added Loop.replace to easily switch one loop for another (#10324)
  • Added support for --lr_scheduler=ReduceLROnPlateau to the LightningCLI (#10860)
  • Added LightningCLI.configure_optimizers to override the configure_optimizers return value (#10860)
  • Added LightningCLI(auto_registry) flag to register all subclasses of the registerable components automatically (#12108)
  • Added a warning that shows when max_epochs in the Trainer is not set (#10700)
  • Added support for returning a single Callback from LightningModule.configure_callbacks without wrapping it into a list (#11060)
  • Added console_kwargs for RichProgressBar to initialize inner Console (#10875)
  • Added support for shorthand notation to instantiate loggers with the LightningCLI (#11533)

... (truncated)

Commits
  • 0b0f1ec Update CHANGELOG.md
  • 2187f30 Support auto_select_gpus with accelerator and devices api (#12608)
  • d95447a Update deepspeed precision test (#12727)
  • 83e0c4a Raise MisconfigurationException when the accelerator is available but… (#12...
  • ba1e869 CI: check docker requires (#12677)
  • dfd9730 fix import failer (#12676)
  • 6aca974 Run main progress bar independent of val progress bar in TQDMProgressBar (#...
  • 23d3d46 Fix flaky test, that is not consistent on some configurations (#12707)
  • 8bc4a76 Update LightningCLI tests to reflect changes in jsonargparse 4.6.0 (#12704)
  • a4026fe Fix a typo in warning message inside Trainer.reset_train_dataloader (#12645)
  • Additional commits viewable in compare view


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
dependabot[bot] commented 2 years ago

Superseded by #146.