hip-kernel-language Search Results

417 results
for hip-kernel-language

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

llvm/llvm-project #65826

[OpenMP] Stack frame size exceeds limit on AMD GPU targets

Compiling Grid with OpenMP target offload to AMD GPUs, throws errors: ``` error: stack frame size (149840) exceeds limit (131056) in function '__omp_offloading_72_1e118ab9__ZN4Grid7LatticeINS_7iSc…

atif4461 updated 3 months ago
11
vllm-project/vllm #4313

[Installation]: Compile and Install from source

### Your current environment ```text PyTorch version: 2.2.1+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 20.04.3 LTS (x86_64) GCC ve…

chenchunhui97 updated 1 month ago
11
lamikr/rocm_sdk_builder #12

Fedora 40: Compile failure after GCC14 patch

Initial compile attempt failed with this issue here: https://github.com/ROCm/rocm_smi_lib/issues/170 After applying the following patch, this fixed the initial issue above: ``` --- a/include/r…

Crizle updated 1 month ago
39
are-we-gfx1100-yet/triton #1

Complete support for RX 7000 series

The upstream repo is now supporting RX 7000 series, but there are failed tests: ``` FAILED python/test/unit/language/test_core_amd.py::test_reduce1d[min-int16-128] FAILED python/test/unit/languag…

evshiron updated 1 year ago
1
openmm/openmm #3937

Investigate slow OpenCL performance on AMD

This is to continue the discussion that started in #3934. On AMD GPUs, the OpenCL platform is sometimes several times slower than the HIP platform. We're trying to figure out why. Much of the slown…

peastman updated 8 months ago
105
apache/mxnet #621

Support for other Device Types, OpenCL AMD GPU

It would be nice to eventually have OpenCL support for those of us with GPUs that don't do CUDA.

philtomson updated 3 years ago
43
ROCm/HIP #2524

`clock` function is redeclared with the wrong return type

/opt/rocm/include/hip/amd_detail/amd_device_functions.h:995 declares`clock()` to have return type `long long`. ctime.h declares it to have return type `clock_t`, which is `long`. Please fix.

seanbaxter updated 1 week ago
2
pytorch/pytorch #121367

`torch.compile` makes triton kernel slower in some cases

### 🐛 Describe the bug Hello, this is a follow-up issue of the previous https://github.com/pytorch/pytorch/issues/120478. The original issue was fixed in PR https://github.com/pytorch/pytorch/pull/12…

HeyangQin updated 4 months ago
3
ROCm/clang-ocl #4

Compile binary on Linux and load it on windows?

I am trying to use this script to compile a kernel on linux and load the binary in my windows program with clCreateProgramWithBinary. Will this work? I already get the ROCm compiler, and this clang…

behindthepixels updated 6 years ago
10
triton-lang/triton #4178

Hitting an assertion in `RemoveLayoutConversions` Pass. Rele…

Hi all, I am working on a kernel which hits an assertion in `RemoveLayoutConversions` pass during the IR rewrite (the latest `main` branch). The bug is common for both `cuda` and `hip` backends. …

ravil-mobile updated 2 weeks ago
2

上一页 1...5 6 7 8 9 10 11...42 下一页

417 results for hip-kernel-language

417 results
for hip-kernel-language