mosaicml / llm-foundry

LLM training code for Databricks foundation models
https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
Apache License 2.0
3.99k stars 525 forks source link

Using self.shift_labels instead of self.model.transformer.shift_label in the loss function. #1211

Closed ShashankMosaicML closed 4 months ago

ShashankMosaicML commented 4 months ago

Using self.shift_labels instead of self.model.transformer.shift_label in the loss function.