-
### 🐛 Describe the bug
!!! Exception during processing!!! free_upper_bound + pytorch_used_bytes[device]
-
### 🐛 Describe the bug
The process is working correctly with DDP world size 1 but then with world size > 1 is going to hang with GPU 0 at 0% and GPU 1 fixed to max occupancy. I've replicated this bot…
-
I was trying to run the DLRMv2 benchmark of MLPerf Inference on an ARM server using the instructions [here]( https://docs.mlcommons.org/inference/benchmarks/recommendation/dlrm-v2/#__tabbed_15_1).
…
-
### 🐛 Describe the bug
`torch.nansum` does not work with complex tensors containing `nan` values on CPU (works on GPU) ([colab](https://colab.research.google.com/drive/1b_3zgqEQqdFjKOW-TisFy_ED47Xo52…
-
Platforms: asan, linux, mac, macos, win, windows
This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_device_mode_ops_sparse_mm_reduce…
-
### Search before asking
- [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report.
### Ultralytics YOLO Component
Trai…
-
My training starts but it does not show any progres or speed. I thought it should show speed for every step and ending time?
How to see training speed in it/s. Or I have to wait one epoch to end?
gp…
-
It trains fine for a while, and then often I get a CPU OOM, which looks like:
```
[2024-01-04 11:41:05,662] INFO: Start Job: Job Task: run
...
RETURNN starting up, version 1.20240104.103023+git.a0…
-
## Prebuilt wheels for PyTorch packages with custom ops
I've created a repository that can build PyTorch wheels with custom ops through the GitHub Actions pipeline and publish them using GitHub Rel…
-
### 🚀 The feature, motivation and pitch
Extend the soc list to
- [ ] https://www.devicespecifications.com/en/model-cpu/36b552e1
- [ ] samsung note 10
- [ ] samsung note 20
### Alternatives…