Modalities / modalities

A framework for training multimodal foundation models.
MIT License
38 stars 3 forks source link

Towards stable modalities version #141

Closed le1nux closed 2 days ago

le1nux commented 1 month ago

This PR further stabilise the codebase and makes training more robust also w.r.t. loss spikes, which we fixed via scaled weight initialisation and an increased batch size in our experiments. The PR also fixes all failing tests and adds a simple entrypoint for running cpu, single-gpu and multi-gpu tests. The PR contains multiple sub PRs.

General changes:

Breaking changes:

Checklist before submitting final PR