fsdp Search Results - Githubissues

1000+ results
for fsdp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

axolotl-ai-cloud/axolotl #1386

mistral fsdpa qlora crashes (cu_seqlens)

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…

lucyknada updated 7 months ago
2
meta-llama/llama-recipes #655

FP8 support for training

### 🚀 The feature, motivation and pitch Is there a plan to add FP8 support for training? ### Alternatives _No response_ ### Additional context _No response_

mathmax12 updated 2 weeks ago
2
pytorch/pytorch #91879

ddp vs fsdp

### 🐛 Describe the bug I used fsdp+ShardedGradScaler to train my model. Compared with apex. amp+ddp, the precision of my model has decreased. The ddp is like ``` model, optimizer = amp.initial…

chexiangying updated 1 year ago
9
chaoyi-wu/Finetune_LLAMA #10

如何实现多节点fsdp

您好，我在论文中看到你们在pretrain阶段用32张卡训练。我想请问如何用trainer fsdp实现多节点训练呢。例如我想在2个节点16个A100上训练，应该怎么用trainer实现，模型是会切片分到16个gpu上吗？

boyue-jiang updated 8 months ago
1
pytorch/pytorch #77724

FSDP: enhanced shared parameter support

### 🚀 The feature, motivation and pitch As of 1.12, only limited shared param support exists for FSDP, i.e., they must be part of the same FSDP unit, user cannot shared parameters if their respective…

rohan-varma updated 5 months ago
5
pytorch/pytorch #82070

[FSDP] deepcopy FSDP model for EMA results in error

### 🐛 Describe the bug The model I want to train is more stably trained with EMA, I want to apply the FSDP to the model so as to train with much larger model sizes, with code like the below: ```py…

taoisu updated 2 years ago
2
pytorch/pytorch #121181

Profiling tool for DDP and FSDP

### 🚀 The feature, motivation and pitch A good profiling tool appears to be lacking for both DDP and FSDP. ### Alternatives None. ### Additional context Something like Horovod Timeline but bette…

whatdhack updated 4 months ago
14
pytorch/pytorch #120003

[FSDP2] Eager-Mode Execution Tracker

## Work Items * Meta-device initialization / `_apply()` methods - [x] Support initial meta-device initialization using `swap_tensors` path - [ ] Remove manual padding logic after https://github…

awgu updated 2 months ago
8
modelscope/FunASR #1899

Funasr如何在Ubuntu18.04.6LTS上导出预训练模型的ONNX？

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## ❓ Questions and Help How does Funasr export ONNX for pre-tra…

LP-world2002 updated 2 months ago
1
axolotl-ai-cloud/axolotl #1589

8-Bit DoRA training with FSDP doesn't work, but 4-bit QDoRA …

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…

kalomaze updated 5 months ago
3

上一页 1...21 22 23 24 25 26 27...100 下一页

1000+ results for fsdp

1000+ results
for fsdp