Modalities / modalities

A framework for training multimodal foundation models.
MIT License
57 stars 5 forks source link

Fix: Computation of Total Number of Parameters #186

Closed mali-git closed 1 month ago

mali-git commented 1 month ago

What does this PR do?

This PR updates the computation of the total number parameters in a (sharded) model. Previously, the parameter count only reflected the number of parameters in a single shard wher the model was sharded.

General Changes

Breaking Changes

Checklist before submitting final PR