simt Search Results - Githubissues

495 results
for simt

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

intel/intel-xpu-backend-for-triton #171

GEMM Block-pointer Path

get triton gemm perf 80% of oneDNN/XeTLA utilizing genISA/vc-intrinsics. the lowering pipeline would be "triton -> tritongpu -> optimized/simplified tritongpu => llvm/spirv". this serves as an um…

Dewei-Wang-sh updated 6 months ago
14
openmm/openmm #2489

Future of osx GPU support

Now that Apple [deprecated OpenCL support in `osx` 10.12]() and [NVIDIA will no longer provide CUDA support for `osx` after CUDA 10.2](https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.htm…

jchodera updated 1 week ago
206
quassnoi/explain-extended-2024 #1

Thoughts on GPU vs CPU

I'm assuming this can't be optimised to run on GPU (unless Postgres has some extension) How long does it take comparatively. Is it 100x slower?

wweevv-johndpope updated 8 months ago
2
OSU-STARLAB/Simul-LLM #4

Request for Sharing the Adapter Model of NMT LLM (Falcon or …

Hello, I want to express my appreciation for your impressive work on SiMT community. I am interested in obtaining the adapter model for NMT LLM (Falcon or Llama2) and would like to kindly request …

zhongmz updated 10 months ago
2
intel/intel-xpu-backend-for-triton #400

[DPAS] The DPAS operation results are not correct for the D …

In the cases to lower the `tt.dot` to dpas with the fp16 D type, the results of the DPAS is not correct. The DPAS op in the MLIR with GenX dialect: `%23884 = genx.matrix.dpas %23613, %23165, %2336…

chengjunlu updated 5 months ago
8
NVIDIA/cutlass #1474

[QST] PyTorch + CUTLASS Batched GEMM Kernel - Expression Mus…

Good evening, all. I am attempting to compile a minimal [CUTLASS](https://github.com/NVIDIA/cutlass/releases/tag/v3.4.1) GEMM example in a PyTorch project. I want to write a simple CUTLASS kernel and …

nickjeliopoulos updated 7 months ago
5
chjackson/flexsurv #64

Plotting Kaplan Meier curves for models with interval censor…

The following gives a warning "Invalid status value, converted to NA" ``` simt

chjackson updated 11 months ago
1
NVIDIA/cutlass #1323

[BUG] Stride is ignored for dst tensor of a Conv2dFprop

I have implemented a basic sample code to convolve a 2D image with a row filter. It works, but when the dst image has some stride, it seems ignored by CUTLASS and all the extra elements of the first …

chacha21 updated 19 hours ago
41
THU-DSP-LAB/ventus-gpgpu-isa-simulator #13

[bug] instruction join jumps to wrong rpc

在执行嵌套分支的时候，内层分支在汇合时，join指令会在rpc和当前pc不一致的情况下导致simt_stack出栈。 ![80ac4863b87949f874255b6846c182e](https://github.com/THU-DSP-LAB/ventus-gpgpu-isa-simulator/assets/37099022/f0821c77-ba88-4bdb-930e-4b3…

yangzexia updated 11 months ago
2
accel-sim/accel-sim-framework #267

error when compile gpu-simulator

i encounter errors when build accel-sim simulator following readme, and no previous issue related to this problem i ran ```bash pip3 install -r requirements.txt source ./gpu-simulator/setup_enviro…

Charles-Tang updated 4 months ago
9

上一页 1...17 18 19 20 21 22 23...50 下一页

495 results for simt

495 results
for simt