Modalities / modalities

Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.
MIT License
59 stars 5 forks source link

Make Activation Checkpointing Configurable #159

Closed mali-git closed 3 months ago

mali-git commented 3 months ago

This PR addresses the issue described in Issue #32 by removing the hard-coded dependency for applying activation checkpointing specifically to GPT2Block. Instead, activation checkpointing is now made configurable through the configuration file, allowing greater flexibility and adaptability for different models.

General changes:

Breaking changes:

Checklist before submitting final PR

le1nux commented 3 months ago

LGTM :-) Nice work!

mali-git commented 3 months ago

we can still remove accelerate from the pytorch.toml as a dependency but besides that it looks great!

Done in 8605dbecbe46c068668f283bf4a6352c85ca1c21.