fsdp Search Results - Githubissues

1000+ results
for fsdp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/PiPPy #1104

FSDP+PP tracer issue with cast-to-bf16

https://github.com/pytorch/torchtitan/pull/161/files#diff-80b04fce2b861d9470c6160853441793678ca13904dae2a9b8b7145f29cd017aR254 In principle, the issue is that the PP model code traced the non-F…

wconstab updated 5 months ago
9
pytorch/pytorch #125738

[FSDP2] _sharded_param_data is sitll on meta while sharded_p…

### 🚀 The feature, motivation and pitch _sharded_param_data is sitll on meta while sharded_param moved to cuda after calling initialize_parameters() the workaround is model = model.to("cuda"). b…

weifengpy updated 5 months ago
2
Alpha-VLLM/Lumina-T2X #42

论文里展示的更小的T2I模型，大概什么适合会释放出来啊？现有的2B的NEXT模型V100也还是训练不起来吧

heart-du updated 3 months ago
7
huggingface/accelerate #422

Feature request: FSDP for TPUs

A recent contribution to the pytorch_xla repo allows using FSDP in PyTorch XLA for sharding Module parameters across data-parallel workers. https://github.com/pytorch/xla/pull/3431 Some motivation be…

OhadRubin updated 1 year ago
7
modelscope/FunASR #1859

Funasr has a problem exporting ONNX of the pre-trained model…

#### What is your question? I used the python method in the Funasr documentation for exporting ONNX models to try to export the ONNX of the pretrained model paraex-en-Streaming, but kept getting er…

LP-world2002 updated 2 months ago
5
brian6091/Dreambooth #52

Error when run colab notebook

i get the below error when i run training cell in colab FineTuning_colab.ipynb also run cell Training parameters and all parameter parsed No LSB modules are available. Description: Ubuntu 20.04.…

ken2190 updated 10 months ago
3
aws-neuron/aws-neuron-sdk #502

[torch-neuronx] FSDP support - Distributed Training on Trn1

[torch-neuronx] FSDP support - Distributed Training on Trn1

aws-rxgupta updated 1 year ago
3
foundation-model-stack/fms-acceleration #83

Distributed Training Problems for QLoRA models with Transfor…

### Root Cause The root cause is due to recent transformers update [to resolve high CPU usage for large quantized models](https://github.com/huggingface/transformers/pull/33154). - what the PR doe…

achew010 updated 3 days ago
2
facebookresearch/fairscale #697

[FSDP] Flatten parameters by group

## 🚀 Feature FSDP to offer the possibility to flatten parameters by group, for instance, to flatten all biases separately from the other weights. ## Motivation Following issue https://github.…

QuentinDuval updated 2 years ago
7
pytorch/pytorch #98419

Support backward hook optimizers in FSDP

### 🚀 The feature, motivation and pitch I'm currently optimizing the [Lightning reference implementation of LLaMA](https://github.com/Lightning-AI/lit-llama) (7B), although the following will be gene…

robieta updated 4 months ago
5

上一页 1...22 23 24 25 26 27 28...100 下一页

1000+ results for fsdp

1000+ results
for fsdp