fsdp Search Results - Githubissues

1000+ results
for fsdp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #96372

[BE] Avoid .data usage in FSDP buffer casting

### 🐛 Describe the bug FSDP casts buffers for mixed precision, but uses `buf.data` to assign to, we should avoid the .data usage by de-registering the old buffer and re-registering the low precision …

rohan-varma updated 1 year ago
2
lancedb/lance #2204

Using lance with PyTorch dataloaders

Hello, I am looking at lance for a pytorch dataloader. I am having issues with a lance based loader (like this one https://lancedb.github.io/lance/examples/llm_training.html) when using it in a di…

jn2clark updated 2 months ago
7
RAIVNLab/MatFormer-OLMo #2

Issues with FineTuning Checkpoint

We were trying to finetune a Matformer checkpoint ( MatFormer-OLMo-180M [Link](https://drive.google.com/drive/folders/1hI8wlHzQYRLfC4XdnS5Xl1vwV8S2UA0f?usp=sharing ) ) We used the following comma…

Advaid-Deepak updated 5 months ago
2
OpenLMLab/LOMO #9

What is the difference from official PyTorch DDP hooks?

It is a classical idea to overlap the backward pass and the optimization step. PyTorch supports this overlapping in DDP and FSDP. For example, here are hooks in DDP https://github.com/pytorch/pytorch/…

wangkuiyi updated 1 year ago
1
lxuechen/private-transformers #32

Support for multi-gpu private fine-tuning

Hi all, I wanted to try and add support for multi-gpu training to allow the fine-tuning of LLM. I've already [opened an issue](https://github.com/lxuechen/private-transformers/issues/31) a few week…

Pier297 updated 1 year ago
2
mosaicml/streaming #686

Suboptimal usage of 8xH100 GPUs - Streaming dataloader speed…

Setup - Environment: Pytorch 2.3.0, composer 0.22.0, streaming 0.7.4 - GPU: 8xH100 sxm, BF16 mode This issue is related #643 but concerns a more subtle issue with Streaming datasets. Over the cou…

VSehwag updated 1 month ago
7
pytorch/pytorch #119698

torch._inductor.triton_heuristics.cached_autotune is not thr…

### 🐛 Describe the bug In multiprocessing mode (i.e. FSDP/DDP), there occur JSONDecodeErrors within torch._inductor.triton_heuristics.cached_autotune, if the filesystem does not lock the file itself.…

kpoeppel updated 1 month ago
4
pytorch/pytorch #103254

Unexpected High PCIe traffic in Distributed Training since P…

### 🐛 Describe the bug Since PT 2, we have noticed significant amount of PCIe traffic between host and device, which is something we didn't expect to happen and not observed in PT 1.x version. This…

lchu-ibm updated 2 months ago
28
pytorch/pytorch #67570

RFC: Overlap optimizer computation with DDP/FSDP backward

with @cbalioglu **Context** Communication/computation overlap is a well-known theme in data parallel training where developers exploit any independence in the forward/backward/optimizer passes …

rohan-varma updated 2 years ago
1
Lightning-AI/pytorch-lightning #11863

Flatten the Strategy inheritance

## Proposed refactor Flatten the Strategy inheritance: Part of #10416 ### Motivation Reduce coupling between strategies, reduce unintentional overrides/inheritance and avoid silent failures …

four4fish updated 1 year ago
1

上一页 1...47 48 49 50 51 52 53...100 下一页

1000+ results for fsdp

1000+ results
for fsdp