foundation-model-stack / fms-hf-tuning

🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
Apache License 2.0
28 stars 48 forks source link

fix: avoid docker push timeout by splitting the single big layer into 2 smaller layers #331

Closed HarikrishnanBalagopal closed 2 months ago

HarikrishnanBalagopal commented 2 months ago

Description of the change

Split the 6GB layer into a two around 3GB layers

Related issue number

How to verify the PR

Was the PR tested

kmehant commented 2 months ago

cherry picked