fsdp Search Results - Githubissues

1000+ results
for fsdp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Lightning-AI/pytorch-lightning #18786

Support saving and loading remote paths with FSDP

### Bug description `FSDPStrategy.load_checkpoint` casts `checkpoint_path` to a `pathlib.Path` [here](https://github.com/Lightning-AI/lightning/blob/master/src/lightning/pytorch/strategies/fsdp.py#…

schmidt-ai updated 3 months ago
5
pytorch/torchtune #1275

Gradient accumulation is not efficiently implemented for dis…

For distributed recipes, such as full_finetune_distributed, the gradients end up getting synchronized after each backward() pass instead of only once before the optimizer step. This results in signifi…

physicsrob updated 1 month ago
13
ibrahimethemhamamci/CT-CLIP #16

Saved model is incomplete when use --use_fsdp

Thank you for open source such meaningful work! I had some problems during training. When training with --use_fsdp, the saved model is incomplete, and saved state_dict only contains partial visual t…

wangxiaoyuwangdayu updated 4 months ago
10
pytorch/torchtitan #594

Support Gemma2 in torchtitan

Are there any plans to support Gemma2 in the torchtitan? I tried to use torchtitan to finetune Gemma2 model, but stuck on the following problem: how to parallelize tied layer in Gemma2 model? Maybe so…

pansershrek updated 1 week ago
3
caikit/caikit-nlp #177

Enable FSDP for Prompt Tuning via Peft

## Description As a user of prompt tuning, I want to be able to leverage multiple GPUs at train time! ## Discussion Extends https://github.com/caikit/caikit-nlp/issues/175 to leverage PyTorch…

alex-jw-brooks updated 11 months ago
1
AnswerDotAI/fsdp_qlora #47

process 0 terminated with signal SIGKILL

I am interested your project. It is full of your work. But i met this bug for this project, please help me! @jph00 @johnowhitaker @KeremTurgutlu @warner-benjamin @geronimi73 World size: 2 Downlo…

hsb1995 updated 6 months ago
4
huggingface/alignment-handbook #169

Cannot flatten integer dtype tensors

Thank you guys for your work! i was using fsdp + qlora fine tuning llama3 70B on 8* A100 80G, and i encountered this error: ```shell Traceback (most recent call last): File "/mnt/209180/qis…

jaywongs updated 3 months ago
1
pytorch/pytorch #136336

FSDP2 error loading `PreTrainedModel`'s `torch.Tensor` state…

### 🐛 Describe the bug I'm trying to follow the instructions to efficiently load Hugging Face models from [`torchtitan`'s docs for FSDP1 -> FSDP2: Meta-Device Initialization](https://github.com/pyt…

ringohoffman updated 3 days ago
5
facebookresearch/fairscale #963

Support BF16 for FSDP

## Feature Request Please support BF16 mixed-precision ## Additional context Training with BF16 is usually more stable than fp16, which is very important when we want to train large models. Addit…

yuvalkirstain updated 2 years ago
8
facebookresearch/fairscale #648

[FSDP] Wrapping model again in FSDP doesn't contain root par…

## 🐛 Bug Related https://github.com/PyTorchLightning/pytorch-lightning/pull/6152 When wrapping the module twice in FSDP, because we introduce a `FlattenParamsWrapper` that contains all the param…

SeanNaren updated 2 years ago
11

上一页 1...18 19 20 21 22 23 24...100 下一页

1000+ results for fsdp

1000+ results
for fsdp