🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
28
stars
48
forks
source link
fix: avoid docker push timeout by splitting the single big layer into 2 smaller layers #331
Closed
HarikrishnanBalagopal closed 2 months ago
Description of the change
Split the 6GB layer into a two around 3GB layers
Related issue number
How to verify the PR
Was the PR tested