simt Search Results - Githubissues

intel/graph-compiler #260

XeGPU simt lowering

- [ ] Distribution patterns for XeGPU ops - [ ] Populate XeVM with basic ops for efficient matmul - [ ] Sg_map propagation analysis/pass - [ ] Distributed IR flattening

kurapov-peter updated 2 weeks ago

iree-org/iree #18905

'func.func' op uses 873872 bytes of shared memory; exceeded …

For [this ](https://gist.github.com/nirvedhmeshram/344f11443b96fb9ff022fa283cc6cd8a) matmul like + elementwise IR, we go down the LLVMGPUSIMT pipeline, see dump [here](https://gist.github.com/nirvedhm…

nirvedhmeshram updated 1 week ago

NVIDIA/cutlass #1599

[QST] can group conv support simt version ?

hi, i see that specialization defs of template class `DefaultConv2dGroupFprop` in file `cutlass/conv/kernel/default_conv2d_group_fprop.h` has no OpClassSimt tag, what can i do to support simt version …

tengdecheng updated 3 weeks ago

ROCm/ROCgdb #10

Which version of LLVm is required by ROCgdb to support "DWAR…

Which version of LLVm is required by ROCgdb to support "DWARF Extensions for Optimized SIMT/SIMD (GPU) Debugging"?

Chunming-Zhou updated 2 days ago

THU-DSP-LAB/ventus-gpgpu #1

SIMT-deadlock

is there SIMT-deadlock issue for the SIMT-stack based divergence? how to deal with it if yes?

rill-zhen updated 2 years ago

NVIDIA/cutlass #1800

[QST] kInternalError while increasing warp count in older S…

**What is your question?** Internal CUTLASS error is observed, when I try increasing the warp count for kernel "cutlass_simt_hgemm_256x128_8x2_nt_align1" to values other than default 4x2x1 (by changi…

Shreya-gaur updated 1 month ago

intel/intel-xpu-backend-for-triton #991

[DPAS] Support low precision DPAS on A770 with sub-group-siz…

The Triton XPU has switched to use the OCL interface for DPAS. The OCL interface only supports the sub-group-size=8 with the packed i16 Dtype for A operands. It requires a different layout in the SI…

chengjunlu updated 3 months ago

intel/mlir-extensions #658

[Triton] To inline the VC intrinsic in the SIMT kernel.

## Background The Triton kernel is generated as SIMT major SPIRV kernel. It is because some component has to be used with SIMT paradigm. Like: Intel math library is only SIMT version. But for some m…

chengjunlu updated 1 year ago

traveller59/spconv #706

Data exceed int32 range

```bash RuntimeError: /io/build/temp.linux-x86_64-cpython-37/spconv/build/core_cc/src/cumm/conv/main/ConvMainUnitTest/ConvMainUnitTest_matmul_split_Simt_f32f32f32_0.cu(222) int64_t(N) * int64_t(C) *…

lyhsieh updated 1 month ago

NVIDIA/cutlass #1671

[BUG] cutlass_profiler make error

**Describe the bug** win camek , but make error. -- Enable device reference verification in conv unit tests -- Generating D:/github/cutlass-main/build/test/unit/conv/device/cutlass_test_unit_conv…

kn1ghtc updated 2 months ago

495 results for simt

495 results
for simt