-
### 🐛 Describe the bug
When running inference on the [Lama inpainting model](https://github.com/advimman/lama) under PT2 using the supplied [predict.py](https://github.com/advimman/lama/blob/main/b…
-
cpu/gpu memory is enough to do this job,i run the model gpt-neo-125M in a nvidia 3090-24g and cpu memory is 64g.
system
`win10 with a nvidia 3090-24g and cpu memory is 64g.`
docker image
```
REP…
-
### 🐛 Describe the bug
DistributedDataParallel doesn't work with complex buffers, even when `broadcast_buffers=False`.
```py
import os
import torch
from torch import nn
torch.distributed.i…
-
### 🐛 Describe the bug
I tried to implement the `causal_lower_right` masking in flex attention. This requires the masking function to know the difference in lengths of keys and queries:
```python
…
-
### Prerequisite
- [X] I have searched [Issues](https://github.com/open-mmlab/mmcv/issues) and [Discussions](https://github.com/open-mmlab/mmcv/discussions) but cannot get the expected help.
- [X]…
-
Thanks for reaching out! I like the direction, IMO it's important to have unknown-length types for two reasons:
1) to allow using all of SSE4/AVX2/AVX-512 without source code changes;
2) to enable u…
penzn updated
3 years ago
-
If this is a build issue, please fill out the template below.
### System information
* Operating system: OSX version 10.13.2 (Macbook pro 11,3)
CUDA/nvcc version 9.1
CuDNN version 7.0.5
whe…
-
### 🐛 Describe the bug
Running
```
import torch
cfn = torch.compile(torch.sin, mode='reduce-overhead')
cfn2 = torch.compile(torch.cos, mode='reduce-overhead')
for _ in range(2):
x = cfn2(to…
-
> NOTE: Remember to label this issue with "`ci: sev`"
## Current Status
ongoing
## Error looks like
[CI result page](https://hud.pytorch.org/pytorch/pytorch/pull/136069?sha=546634dec51…
-
### 🐛 Describe the bug
Reproduced below, `dist.barrier()` fails after calls to `torch.distributed.checkpoint.async_save`.
Interestingly enough, this does not happen if we first call `all_reduce…