gpu-kernels Search Results

1000+ results
for gpu-kernels

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

oneapi-src/level-zero-spec #310

Tracing of GPU zeCommandListAppend<..> submitted kernels, co…

# Summary Introduce Level-Zero Core or Tools API that enables setting-up timestamp enabled event (additional to already set) for GPU task that being submitted into the command list. # Details …

jfedorov updated 1 month ago
2
intel/llvm #2053

[SYCL] Unnecessary read_write dependencies when multiple dev…

I have some code that launches multiple kernels and distributes them on multiple queues which are for different CUDA devices. When only 1 gpu is used, we get the following dependency graph: ![dep_gra…

mfbalin updated 2 days ago
5
linkedin/Liger-Kernel #126

[AMD] Implement Flash Attention in Triton to enable transfor…

### 🚀 The feature, motivation and pitch The official implementation of flash attention is in CUDA, so in AMD GPUs, users cannot easily use flash attention on transformers to training LLM. With the …

ByronHsu updated 2 weeks ago
3
quokka-astro/quokka #569

set `-mllvm -amdgpu-function-calls=true` for HIP builds

**Describe the proposal** We should automatically add `-mllvm -amdgpu-function-calls=true` to the compiler flags when `-DAMREX_GPU_BACKEND=HIP` (AMD GPUs). This works around compiler bugs for large G…

BenWibking updated 1 month ago
4
ROCm/AMDMIGraphX #3405

JIRA Ticket: GLIBCXX_3.4.30 not found

Tested with driver: 14616 $ rbuild package -d deps -DGPU_TARGETS=gfx1201 ... $ make check -j$(nproc) [ 1%] Built target embed_lib_migraphx_kernels 349/380 Test #378: test_py_3.…

ahsan-ca updated 1 week ago
2
ROCm/ROCm #3701

[Issue]: amdgpu-dkms fails with Ubuntu kernel 6.8.0-44

### Problem Description Looks like another problem with `amdgpu-dkms` and new kernels. I was running ROCm 6.2 just fine with kernel 6.8.0-41 - Ubuntu just pushed kernel 6.8.0-44, however, `amdgpu-dkm…

Wedge009 updated 4 hours ago
49
PygmalionAI/aphrodite-engine #632

[Usage]: GGUFed models on AMD GPUs

Hello! Having studied the documentation provided, I still could not understand whether there is support for GGUF quantized models on AMD GPU. I would like to use the Q8 or even Q4 model based on Mistr…

TuzelKO updated 1 week ago
1
huggingface/text-embeddings-inference #373

Local intall failure

### System Info ### Full log: Installing text-embeddings-router v1.5.0 (/data_train/search/InternData/jiejuntan/python/text-embeddings-inference/router) Updating git repository `https://githu…

plageon updated 1 week ago
2
ROCm/ROCgdb #14

Symbol debugging for kernels

Do we have symbol debuging for gpu kernels now? I am using ROCgdb shipped with rocm-4.5.2. When checking local variables or args in rocgdb, it always shows "Optimized". Wondering it was optimized …

hgtsoi updated 1 month ago
6
L-Bolt/GPU-programming #6

GPU class uitbreiden voor meerdere kernels

Pas de GPU class aan zodat er meerdere kernels geladen kunnen worden en de argumenten voor de meerdere kernels ook goed gezet worden.

L-Bolt updated 1 year ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for gpu-kernels

1000+ results
for gpu-kernels