cuda-kernels Search Results

1000+ results
for cuda-kernels

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

fixstars/libSGM #26

Indexing of CUDA kernels

Hi, I was trying to understand the implementation of SGM in the code. Although the cuda stuffs are doing their job perfectly, I have the following question : Why the indexing in some kernels prett…

funmonty updated 1 year ago
3
denizyuret/AutoGrad.jl #128

AutoGrad and CUDA kernels

Are there any plans to support custom CUDA kernels? This would be amazing.

renatobellotti updated 2 years ago
1
rapidsai/cucim #771

[BUG] test failures in shared memory variant of 2D filtering…

**Describe the bug** For a Windows 11 system with CuPy 13.3.0, NumPy 1.26.4, CUDA 12.6 and an RTX A3000 GPU, some test failures were seen in some tests of separable filters: **Steps/Code to reprodu…

grlee77 updated 2 weeks ago
1
kotekan/kotekan #946

CUDA Kernels for HIRAX

This is a catch all ticket for the various CUDA related development work for HIRAX: - [ ] Transpose kernels to format data for the N^2 Tensor Core kernel - [X] N^2 Tensor-Core kernels - [ ] Full …

andrerenard updated 1 year ago
1
ralna/spral #219

-lcudart_static and -lcublas not found using meson build sys…

Hello, I've been trying to compile spral for a while now to use it later with Ipopt. First , when compiling with autotools and running make check, the test corresponding to ssids_test fails with a …

aleramos119 updated 2 days ago
10
NVlabs/DiffRL #12

Error when python test_env.py --env AntEnv

Excuse me, I met such problem when I try the command ```python test_env.py --env AntEnv``` in the folder ```examples``` as the guide The version of my Pytorch is 1.11.0, cuda is 12.1 Is there anythi…

HzfFrank updated 1 month ago
6
alpaka-group/alpaka #2371

Document the synchronisation behavious of alpaka buffers

As far as I know, we do not document any synchronisation behaviours for alpaka buffers. However, different buffers and different back-ends effectively implement different behaviours. * A buffer …

fwyzard updated 2 weeks ago
1
NetEase-FuXi/EETQ #30

Unsupported Arch Assertion fail

I am getting this run time error sourced from this file eetq/csrc/cutlass_kernels/cutlass_preprocessors.cc:125. Using TGI text generation launcher with falcon-7b-instruct model. I would like to kn…

rahul3161 updated 1 week ago
1
Drexubery/ViewCrafter #3

Using slow RoPE2D version?

Hi, when trying to run the project locally I encounter the following warning: `Warning, cannot find cuda-compiled version of RoPE2D, using a slow pytorch version instead` - does this impose a si…

atonderski updated 4 days ago
1
mit-han-lab/TinyChatEngine #71

Assistant spitting out non-readable characters on RTX 4060

``` (TinyChatEngine) zhef@zhef:~/TinyChatEngine/llm$ make chat -j CUDA is available! src/Generate.cc src/LLaMATokenizer.cc src/OPTGenerate.cc src/OPTTokenizer.cc src/utils.cc src/nn_modules/Fp32OPT…

zhefciad updated 1 month ago
2

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for cuda-kernels

1000+ results
for cuda-kernels