issues
search
aai-institute
/
continuiti
Learning function operators with neural networks.
GNU Lesser General Public License v3.0
21
stars
3
forks
source link
Parallel Trainer
#62
Closed
samuelburbulla
closed
7 months ago
samuelburbulla
commented
7 months ago
Feature: Parallel Trainer
Description
Which issue does this PR tackle?
Trainer does not support parallel (multi-GPU) training.
Fixes #61
How does it solve the problem?
Implements DDP.
How are the changes tested?
Added a parallel run script using test_trainer. Can not be tested in CI.
Checklist for Contributors
[x] Scope: This PR tackles exactly one problem.
[x] Conventions: The branch follows the
feature/title-slug
convention.
[x] Conventions: The PR title follows the
Bugfix: Title
convention.
[x] Coding style: The code passes all pre-commit hooks.
[x] Documentation: All changes are well-documented.
[x] Tests: New features are tested and all tests pass successfully.
[x] Changelog: Updated CHANGELOG.md for new features or breaking changes.
[x] Review: A suitable reviewer has been assigned.
Checklist for Reviewers:
[x] The PR solves the issue it claims to solve and only this one.
[x] Changes are tested sufficiently and all tests pass.
[x] Documentation is complete and well-written.
[x] Changelog has been updated, if necessary.
Feature: Parallel Trainer
Description
Which issue does this PR tackle?
How does it solve the problem?
How are the changes tested?
Checklist for Contributors
feature/title-slug
convention.Bugfix: Title
convention.Checklist for Reviewers: