-
### 🐛 Describe the bug
The batched GEMM has a poor performance for bigger batch size(`12*7*120*64*129`) with smaller matrix size(`3x3`, `3x1`):
```python
import torch
import time
points = t…
-
## Issue description
I'm using PyTorch DDP, DeepSpeed ZeRO, and PyTorch FSDP to train my model that contains several `nn.Conv1d` layers. `Conv1d.forward` eventually calls `aten::cudnn_convolution` …
-
Hello.
I have a problem after I trained your sample images, which you did upload in baidu.
And I apply the mat file in your testSRnet_result.m after the training and,
I also apply the mat file, you…
-
### 🐛 Describe the bug
I am trying to quantize a yolov5 model trained on a custom dataset using pytorch static quantization.
However I am having a bug with this function :
model_prepared = qu…
-
## ❓ Questions and Help
Hi there! I've finally been able to get a Mask R-CNN to train but unfortunately the results are not great. This is probably due to the fact that I'm using medical imaging da…
-
### 🐛 Describe the bug
RuntimeError: "replication_pad2d_cuda" not implemented for 'BFloat16'
### Versions
```
Collecting environment information...
PyTorch version: 2.1.2+cu121
Is debug build: F…
-
### 🐛 Describe the bug
I noticed that `torch.linalg.lstsq` is returning the wrong results for this sample which I attached here for reproducibility: https://drive.google.com/drive/folders/1rLJRrOmR_Y…
-
### 🐛 Describe the bug
When I ran the TensorParallel test `test_mlp_inference` under here [`test/distributed/tensor/parallel/test_tp_examples.py`](https://github.com/pytorch/pytorch/blob/main/test/di…
-
### Your current environment
The output of `python collect_env.py`
(pytorch) [opc@instance-20240805-1058 ~]$ python collect_env.py
Collecting environment information...
PyTorch version: 2.4.0
…
-
Is there a way to save the model in its entirety i.e. weights and architecture via standard means? Currently the weights of the Mask RCNN are saved at every epoch in the logs directory. Now how can I …