mosaicml / llm-foundry

LLM training code for Databricks foundation models
https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
Apache License 2.0
3.83k stars 502 forks source link

Extendability refactors #1290

Closed dakinggg closed 1 week ago

dakinggg commented 2 weeks ago

This PR includes a few changes for increased extendability of the code:

Loss before and after:

Screenshot 2024-06-19 at 10 54 00 PM
dakinggg commented 1 week ago

@milocress the GPU test is unrelated. It will be fixed by the next composer release (which is why that test isn't marked as required yet)