-
**Description**
A clear and concise description of what the bug is.
I am trying to use the newly introduced [triton inference server In-Process python API](https://github.com/triton-inference-server…
-
hello, i have tried to use megablocks in V100 + pytorch2.4.0+cu121, but get error with "cannot support bf16". If i use megablocks in fp32, i get error "group gemm must use bf16". So i change my enviro…
-
### System Info
Hello I am trying to load Mistral-Nemo Instruct-2407 in bnb 4bit on 4 A10 gpus on ec2 instance.
I upgraded all the packages.
Still I face cuda memory out of error when train batc…
-
Hi,
With the maturity of Panama [Foreign-Function and Memory API](https://openjdk.org/jeps/454) now official in java 22, are there future plans JOCL and of course JCUDA to utilise it? I've seen it…
-
### 🐛 Describe the bug
image_syn = torch.randn(size=(num_classes*args.dlipc, channel, im_size[0], im_size[1]), dtype=torch.float, requires_grad=True, device=args.dldevice) * 0.01
optimizer_img …
-
### Your current environment
The output of `python collect_env.py`
```text
Your output of `python collect_env.py` here
```
Defaulted container "kserve-container" out of: kserve-container,…
-
Requirement already satisfied: packaging in ./venv/lib/python3.11/site-packages (from -r requirements.txt (line 1)) (21.3)
Collecting torch==2.1.0 (from -r requirements.txt (line 2))
Using cached …
-
Hi,
we met the problem when we ran code with our own adata:
AnnData object with n_obs × n_vars = 28374 × 25086
obs: 'cell_subtype', 'batch', 'n_genes_by_counts', 'total_counts', 'total_counts…
-
### Your current environment
```
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
OS: Ubuntu 22.04.3 LTS (x86_64)
GCC vers…
-
### bug描述 Describe the Bug
https://github.com/PaddlePaddle/PaddleOCR/issues/13912
### 整个环境分别在两个主(134)从(131)机的docker容器环境下, 容器的网络是--ipc=host --network=host --gpus all;主从机已经分别指定nccl通信的网卡;ssh也已经互为免密,s…