-
When submitting a bug report, please include the following information (where relevant):
- OS: ubuntu 16.04
- How you installed TC (docker, conda, source): conda
- Python version: 3.6
- CUDA/cuDNN…
-
It would be worth updating the Makefile to import all of the tools required for compilation as it once did and as OP still does. This would make it a little easier for others to contribute to the codi…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
[2024-07-12 02:22:28,334] [INFO] [real_accelerator.py:203:get_accelerator] Setting ds_accelerator to cuda…
-
I'm a little frustrated that when I play the instrument with my theremin, it snaps to the correct pitch rather than interpreting the pitch of the theremin directly. I can't play microtonal scales or …
-
I am having a severe problem with training AlexNet (see [alexnet.jl](https://gist.github.com/hesseltuinhof/01d52e5ba64546bf6b806b3ffcc10c3f#file-alexnet-jl)) in Julia (0.5.2) on my GPU (12gb mem).
…
-
**Describe the bug**
A clear and concise description of what the bug is.
On executing the below command I am facing segmentation fault and I know that there is the other issue similar to this but …
-
**Issue by [dustinvtran](https://github.com/dustinvtran)**
_Sunday Feb 28, 2016 at 23:09 GMT_
_Originally opened as https://github.com/stan-dev/stan/issues/1780_
----
(There are a number of things …
-
The performance summary shows that my model spend ~50% time in the "kernel launch" step.
I find other items easy to understand, but I have no idea what "kernel launch" is, and how I can reduce its ti…
-
当打开tensorrt转换器开关后,转换逻辑报错。
-
### Motivation
In our experiments and applications, the triton autotuner is key to achieve competitive or best performance (e.g. for [flash attention in vLLM](https://github.com/vllm-project/vllm/i…