rasbt / gradient-accumulation-blog

Finetuning BLOOM on a single GPU using gradient-accumulation
https://sebastianraschka.com/blog/2023/llm-grad-accumulation.html
Apache License 2.0
24 stars 3 forks source link
ai bloom deeplearning llm pytorch transformer

gradient-accumulation-blog