-
### Your current environment
```text
The output of `python collect_env.py`
```
Collecting environment information...
PyTorch version: 2.1.2+cu121
Is debug build: False
CUDA used to build PyTorc…
-
**Describe the bug**
fp8 e4m3 wgrad seems to be extremely slow compared to both FP32 and FP16, often 50x to 100x slower.
I have attached the profiling results in [this Google spreadsheet](https://doc…
-
### Describe the issue
Latest version (1.17.3) is missing from artifacts feed:
https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/onnxruntime-cuda-12
### To reproduce
https://aiinfra.…
-
### Your current environment
```text
Collecting environment information...
PyTorch version: N/A
Is debug build: N/A
CUDA used to build PyTorch: N/A
ROCM used to build PyTorch: N/A
OS: Amazo…
-
运行编译时 一直报错,尝试多种办法也无果,报错信息如下,能麻烦帮忙看看吗?
cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /home/wgd@corp.sse.tongji.edu.cn/Bridgin…
-
### What is the issue?
When I upgraded the image to 0.4.0, the previous model encountered this error. The overall information is as follows:
```
2024/11/14 11:29:13 routes.go:1189: INFO server co…
-
**Describe the bug**
As the title says, I get an error when I call ```sin``` function after updating CUDA.jl. It worked fine in the previous version.
I tried some case and is seems that this error p…
-
### Describe the issue
I am aware that I can create and register an allocator to the active environment so that my session does not create it's own allocator, but rather uses the already attached all…
-
While executing GPGPU-sim, am getting a segmentation fault. Looking at the log file, the PTX cuda api is stuck in a loop.
![image](https://github.com/user-attachments/assets/e6a243bd-82fa-4c8b-9dff-d…
-
### Issue summary
Makefile 593 problem, lstm_unit_layer.o' failed
### Steps to reproduce
I followed this post till `make all`
https://github.com/BVLC/caffe/wiki/Ubuntu-16.04-Installation-Guide
…