-
### 🐛 Describe the bug
name
batch_size_new
speed_up_new
inductor_new
eager_new
compilation_latency_new
batch_size_old
speed_up_old…
-
Hello @glenn-jocher I'm sorry for keeping asking questions.
I sincerely ask for your answer.
- gpu test in jupyterlab
1. device check
import torch
print(torch.cuda.get_device_name(0))
…
-
### System Info
pip list |grep -i -E 'cuda|torch'
nvidia-cuda-cupti-cu12 12.1.105
nvidia-cuda-nvrtc-cu12 12.1.105
nvidia-cuda-runtime-cu12 12.1.105
torch 2.3.0
GPU i…
winca updated
5 months ago
-
### Bug description
When using tochdata `DataPipes` and `Dataloader2` the iterator is not closed/ not shutdown. As a result the parent process never exits if a parallel reader is used.
A MWE that …
-
### 🐛 Describe the bug
The `torch.cuda.device_count` function utilizes a LRU cache of size 1, but because it has no arguments, underlying state changes in environment variables can cause an this func…
-
Hi,
I'm trying to do a distributed training on llama-7b in a VM having two Tesla T4 GPU's using ray with strategy as deepspeed. I'm facing the following error "Could not pickle object as excessively…
-
### 🐛 Describe the bug
AMP models static/dynamic default/CPP wrapper accuracy/performance crash
suite
name
thread
accuracy
perf
reason(reference o…
-
### Your current environment
```text
Collecting environment information...
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
…
-
```
root@localhost:/workspace/xtuner# pip install -e '.[all]'
root@localhost:/workspace/xtuner# xtuner
[2023-09-11 15:13:59,638] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelera…
-
### 🐛 Describe the bug
Here’s a brief overview of my process:
1.I generated a dataset using PyTorch by applying the SAM mask from bounding boxes to my images.
2.After creating the dataset, I spli…