gpu-kernels Search Results

1000+ results
for gpu-kernels

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ROCm/rocprofiler #60

rocm profiler creates trace for 1 gpu only when kernels laun…

``` #include #include "hip/hip_runtime.h" // 1. if N is set to up to 1024, then sum is OK. // 2. Set N past the 1024 which is past No. of threads per blocks, and then all iterations of sum resu…

gggh000 updated 1 month ago
4
ROCm/rocprofiler #128

Do I need --parallel-kernels option for multi-kernel on mult…

Hello, I am using concurrent kernel execution on multi-GPU system using multi-stream (see code example below). Example: ``` for(i = 0; i < GPU_N; i++) { .... //Set device hipSetDevice(i)…

mabdallah89 updated 2 weeks ago
1
kokkos/kokkos-kernels #2316

kokkos kernels: broken unit test w/ cuda 12.4 on h100 gpus w…

Hi, I've been testing trilinos and came across a broken kk unit tests on h100s w/ cuda 12.4. I have not tried to reproduce the broken test stand alone but figured I'd report it. See configuration 1…

vasylivy updated 3 weeks ago
11
torch-points3d/torch-points-kernels #103

Failed to install torch-points-kernels

During installation of torch-points-kernels using this command pip install torch-points-kernels==0.7.0, I will faced the error for build wheels Error :- --> Building wheel for torch-points-kerne…

VikasSargar777 updated 2 days ago
4
huggingface/transformers #32861

Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to H…

### Feature request Integrate Liger (Linkedin GPU Efficient Runtime) Kernel to HuggingFace Trainer, user could decide whether to enable kernel with a simple flag ### Motivation Liger (Linkedi…

JasonZhu1313 updated 4 hours ago
4
Rtoax/VTI-FD-CUDA-GTK #1

Undefined Symbol reference errors during the build process

在编译和链接过程中遇到未定义符号的错误 ``` # Copyright (C) RongTao, All right reserve. CUDA_INSTALL_PATH = /usr/local/cuda-12.1 GCC_INSTALL_PATH = /usr NVCC = $(CUDA_INSTALL_PATH)/bin/nvcc #cuda_12.1.r12.1 GCC =…

SIKtt updated 2 months ago
1
slimgroup/Devito4PyTorch #2

Expose GPU kernels for Pytorch

This is an interesting application of Devito. Was wondering if you know of a way to expose GPU computing in devito. I see all the examples go pytorch GPU -> numpy cpu -> Devito -> numpy cpu -> pytorch…

loliverhennigh updated 7 months ago
1
PaddlePaddle/PaddleSeg #3587

使用自己的数据集训练U2NET，训练到中途，抛出错误， 'cudaErrorLaunchFailure'. 求教大佬！！

### 问题确认 Search before asking - [X] 我已经搜索过问题，但是没有找到解答。I have searched the question and found no related answer. ### 请提出你的问题 Please ask your question 使用自己标注的数据集训练，使用U2NET模型训练，训练 iter==100时，c…

fxx-chaos updated 3 weeks ago
3
microsoft/onnxruntime #22077

[Feature Request] Native WebGPU Execution Provider

### Describe the feature request Request: Leverage `onnxruntime-web` kernels to create a native WebGPU Execution Provider for **non-web** environments. Story: I am in a unique situation where my…

audioXD updated 1 day ago
2
pytorch/pytorch #135095

Add MSCCL++ as a communication backend for PyTorch

### 🚀 The feature, motivation and pitch MSCCL++ redefines inter-GPU communication interfaces, offering a highly efficient and customizable communication stack tailored for distributed GPU application…

lerrorgk updated 1 day ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for gpu-kernels

1000+ results
for gpu-kernels