-
Hi,
I was trying to understand the implementation of SGM in the code. Although the cuda stuffs are doing their job perfectly, I have the following question :
Why the indexing in some kernels prett…
-
Are there any plans to support custom CUDA kernels? This would be amazing.
-
**Describe the bug**
For a Windows 11 system with CuPy 13.3.0, NumPy 1.26.4, CUDA 12.6 and an RTX A3000 GPU, some test failures were seen in some tests of separable filters:
**Steps/Code to reprodu…
-
This is a catch all ticket for the various CUDA related development work for HIRAX:
- [ ] Transpose kernels to format data for the N^2 Tensor Core kernel
- [X] N^2 Tensor-Core kernels
- [ ] Full …
-
Hello,
I've been trying to compile spral for a while now to use it later with Ipopt. First , when compiling with autotools and running make check, the test corresponding to ssids_test fails with a …
-
Excuse me, I met such problem when I try the command ```python test_env.py --env AntEnv``` in the folder ```examples``` as the guide
The version of my Pytorch is 1.11.0, cuda is 12.1
Is there anythi…
-
As far as I know, we do not document any synchronisation behaviours for alpaka buffers.
However, different buffers and different back-ends effectively implement different behaviours.
* A buffer …
-
I am getting this run time error sourced from this file eetq/csrc/cutlass_kernels/cutlass_preprocessors.cc:125.
Using TGI text generation launcher with falcon-7b-instruct model.
I would like to kn…
-
Hi, when trying to run the project locally I encounter the following warning:
`Warning, cannot find cuda-compiled version of RoPE2D, using a slow pytorch version instead`
- does this impose a si…
-
```
(TinyChatEngine) zhef@zhef:~/TinyChatEngine/llm$ make chat -j
CUDA is available!
src/Generate.cc src/LLaMATokenizer.cc src/OPTGenerate.cc src/OPTTokenizer.cc src/utils.cc src/nn_modules/Fp32OPT…