cuda-kernels Search Results

1000+ results
for cuda-kernels

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mit-han-lab/TinyChatEngine #71

Assistant spitting out non-readable characters on RTX 4060

``` (TinyChatEngine) zhef@zhef:~/TinyChatEngine/llm$ make chat -j CUDA is available! src/Generate.cc src/LLaMATokenizer.cc src/OPTGenerate.cc src/OPTTokenizer.cc src/utils.cc src/nn_modules/Fp32OPT…

zhefciad updated 1 month ago
2
arrayfire/arrayfire #3404

Custom CUDA kernel synchronization

Hi, I have a question regarding custom CUDA kernels and synchronization. I tried to proceed as described in [Interoperability with CUDA](https://arrayfire.org/docs/interop_cuda.htm#gsc.tab=0) which st…

FloopCZ updated 2 weeks ago
4
iangellove/Omega-AI #15

Is there any plans to use ND4J

Instead of Cuda Kernels, would it be possibe to use ND4J library instead or would that support be added.

vaiju1981 updated 1 month ago
2
microsoft/onnxruntime #22077

[Feature Request] Native WebGPU Execution Provider

### Describe the feature request Request: Leverage `onnxruntime-web` kernels to create a native WebGPU Execution Provider for **non-web** environments. Story: I am in a unique situation where my…

audioXD updated 10 hours ago
1
huggingface/optimum-quanto #275

Support for FP8 Matmuls

Int8 matrix multiplication kernels are currently called on CUDA and CPU devices when activations and weights are quantized to int8. However, FP8 matmuls are not used when activations and weights are q…

maktukmak updated 4 days ago
1
huggingface/candle #2241

nvcc fatal : Cannot find compiler 'cl.exe' in PATH

I add the 'cl.exe' to PATH, but still says can not found. ``` error: failed to run custom build command for `candle-kernels v0.5.1 (https://github.com/huggingface/candle.git#cd4d941e)` Caused b…

kdletters updated 1 week ago
3
state-spaces/s4 #147

CUDA error: no kernel image is available for execution on th…

Hi there, I have copied s4.py and the kernel extension into another repository I am working on. I had S4 components running (with CUDA), and then I installed the kernel extensions. The build output …

leoauri updated 2 months ago
3
hpcaitech/ColossalAI #6047

[FEATURE]: Is it Possible to integrate Liger-Kernel?

### Describe the feature https://github.com/linkedin/Liger-Kernel Liger Kernel is a collection of Triton kernels designed specifically for LLM training. It can effectively increase multi-GPU train…

ericxsun updated 3 days ago
3
unslothai/unsloth #837

Fast Rotary Position Embedding value different to transforme…

I'm testing unsloth rope and here is my script: ```python import torch from unsloth.kernels.rope_embedding import fast_rope_embedding from unsloth.models.llama import LlamaRotaryEmbedding as Uns…

fahadh4ilyas updated 1 month ago
3
ROCm/hipfort #54

Cuda Fortran kernels support

Will this tool only support hipify of host fortran calls or it will be able to compile Cuda Fortran kernels? For example: ``` attributes(global) subroutine saxpy(x, y, a) implicit none …

maiconfaria updated 1 year ago
15

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for cuda-kernels

1000+ results
for cuda-kernels