fsdp Search Results - Githubissues

1000+ results
for fsdp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #135267

First all_to_all_single / barrier takes a large amount of ti…

### 🐛 Describe the bug I'm getting profiles like this: The first `all_to_all_single` / `barrier` are taking large amounts of time (and invoking thousands of `cudaMemcpyAsync`, `cudaGetDevice…

vedantroy updated 1 month ago
9
lllyasviel/ControlNet #319

Why `pl.Trainer` can not handle multi-gpu case?

I can run the original `tutorial_train.py` with single 3090Ti GPU (24G) with batch_size 3. However, when upgrade to 2 or more gpus, it keep warning OOM. ``` trainer = pl.Trainer(gpus=2 precision=…

doem97 updated 1 year ago
6
yxli2123/LoftQ #17

loftQ can not use multi gpu to train

When I set: import os os.environ['CUDA_VISIBLE_DEVICES'] = '0,1,2,3' will raise error : ../aten/src/ATen/native/cuda/Indexing.cu:1146: indexSelectLargeIndex: block: [42,0,0], thread: [64,0,0] Ass…

WanBenLe updated 4 months ago
9
CompVis/stable-diffusion #180

Leverage deepspeed for significantly faster inference

https://github.com/microsoft/DeepSpeed Deepspeed is a state of the art library that enable out of the box many optimizations for inference. While especially good for clusters, I believe it can bring…

LifeIsStrange updated 1 year ago
1
invoke-ai/InvokeAI #6961

[bug]: [SOLVED]

### Is there an existing issue for this problem? - [X] I have searched the existing issues ### Operating system Windows ### GPU vendor Nvidia (CUDA) ### GPU model RTX 4090 ### GPU VRAM 24 ##…

paulerbear updated 1 week ago
4
facebookresearch/fairseq #3577

AttributeError: 'dict' object has no attribute 'replace' the…

## 🐛 Bug Cant load wav2vec checkpoint as described here https://github.com/pytorch/fairseq/blob/master/examples/wav2vec/README.md ### To Reproduce Run this colab https://colab.research.googl…

hadaev8 updated 3 years ago
1
axolotl-ai-cloud/axolotl #1325

Mamba example config fails on latest docker

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports. …

Layoric updated 5 months ago
5
alpa-projects/alpa #868

Will OPT-IML-175B be supported?

**System information** - Alpa version: 0.2.2 - Are you willing to contribute it (Yes/No): No. **Describe the new feature and the current behavior/state** Alpa supports OPT-175B currently. But t…

JingfengYang updated 1 year ago
8
bryandlee/Tune-A-Video #2

CUDA out of memory Error

![image](https://user-images.githubusercontent.com/20476674/212067731-50506295-9e27-41f3-ab25-558ade9e5fbb.png) It seems that the 32G GPU is not enough. How large memory a GPU is needed for normal op…

westfish updated 1 year ago
4
junjie18/CMT #27

performance on waymo dataset

hi authors, I am curious about the performance of the model on waymo dataset, but this was not mentioned in the paper. May I ask if you have conducted any relevant experiments and what were the res…

lcc815 updated 1 year ago
3

上一页 1...84 85 86 87 88 89 90...100 下一页

1000+ results for fsdp

1000+ results
for fsdp