fsdp Search Results - Githubissues

1000+ results
for fsdp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #137948

DISABLED test_fsdp_unsupported_module_cls (__main__.TestFSDP…

Platforms: linux This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_fsdp_unsupported_module_cls&suite=TestFSDPMiscMultiThread&limit=…

pytorch-bot[bot] updated 4 weeks ago
3
pytorch/pytorch #93493

tensor.to_sparse() handling indices incorrectly under dynamo…

### 🐛 Describe the bug to_sparse() is returning a FakeTensor where the indices attribute has the wrong shape/size. ### Error logs _No response_ ### Minified repro Repro 1: ```python import …

davidberard98 updated 1 year ago
2
pytorch/pytorch #71303

[RFC] Cross-Process Performance Analysis: Straggler Detectio…

### 🚀 The feature, motivation and pitch ## Motivation: Limitation of Existing Profiling Approach To conduct PyTorch distributed training performance analysis, currently a recommended way is profil…

wayi1 updated 2 weeks ago
8
axolotl-ai-cloud/axolotl #1495

qwen moe3 fine tune error

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…

manishiitg updated 7 months ago
6
kohya-ss/sd-scripts #1475

Multi GPU train of flux report error

I use this setting below to train flux lora: ``` accelerate launch --gpu_ids 0,1 --main_process_port 29502 --mixed_precision bf16 --num_cpu_threads_per_process=2 \ flux_train_network.py --pr…

chongxian updated 5 days ago
24
Lightning-AI/lightning-thunder #1188

Recipes and high-level entrypoint

## 🚀 Feature Thunder recipes and new high-level entrypoint. This is important - for new users, or users that just want to take advantage of thunder without getting into the how it works - for …

lantiga updated 2 months ago
5
deep-diver/llamaduo-spinoff #1

Roadmap

We have discussed the following so far. - Decide which domain - Math([GSM8k](https://huggingface.co/datasets/gsm8k)), Code([Stack 2](https://huggingface.co/datasets/bigcode/the-stack-v2)), Gene…

deep-diver updated 6 months ago
6
chaoyi-wu/Finetune_LLAMA #2

attention mask for different documents in dataset chunk

Hi chaoyi, Thanks for your great work. I have a question about dataset tokenization in the following code. https://github.com/chaoyi-wu/Finetune_LLAMA/blob/1d4280e12f584b20cbb92a9f0dfe3a12a5de9…

waterhorse1 updated 1 year ago
3
conceptofmind/PaLM #7

Training with Hidet compiler

Hello! I was wondering if there was anything extra that needed to be done to get training with Hidet compiler working. Out of the box I seem to be running into errors ``` import torch from p…

RameshArvind updated 1 year ago
1
ray-project/ray_lightning #258

Deprecation Notice: ray_lightning to be Replaced with New Li…

Dear ray_lightning users and community members, We wanted to share some important news regarding the future of the ray_lightning library. As you know, ray_lightning has been a valuable library for …

woshiyyya updated 1 year ago
1

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for fsdp

1000+ results
for fsdp