IBM / dolomite-engine

Dolomite Engine is a library for pretraining/finetuning LLMs
Apache License 2.0
23 stars 7 forks source link

Average gradients across gradient accumulation steps #56

Closed mayank31398 closed 1 week ago