wmma-api Search Results

69 results
for wmma-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

JuliaGPU/CUDA.jl #1700

WMMA test failure

Latest GPUCompiler enables IR verification on CI, and it immediately spotted the following bug: ``` device/intrinsics/wmma: Error During Test at /var/lib/buildkite-agent/builds/gpuci-8/julialang/c…

maleadt updated 1 year ago
2
AyakaGEMM/Hands-on-GEMM #5

Intersting project

Seems like you wanna to do an implementation of int8 gemm to do NLP inference? I wonder why you choose to hands on wmma api to do int8 gemm instead of opensource library like cutlass or vendor library…

LeiWang1999 updated 1 year ago
4
JuliaGPU/CUDA.jl #1718

Error during CUDA test

**Describe the bug** I encountered an error while testing CUDA. This is my first time trying to use CUDA. The culprit appears to be cudadrv (drivers?). I already tried updating my Nvidia drivers. …

oneg1101 updated 1 year ago
3
pytorch/torchdynamo #107

Get TorchBench result on Dynamo inference TRT

The CI is ready, working on understanding the results - quite different from what we get from `torchbench.py`

xuzhao9 updated 1 year ago
36
JuliaGPU/CUDA.jl #1431

WMMA kernel works with Julia 1.7.2 but fails with `illegal m…

Using the current CUDA.jl master (`146ad00c0`) the following kernel works with Julia 1.7.2 but fails with Julia 1.8.0-beta1. ```julia function kernel_wmma_int8_lowlevel(a_dev, b_dev, c_dev, d_dev)…

carstenbauer updated 2 years ago
2
PaddlePaddle/Paddle #48321

使用自动混合精度训练，速度不增反降

### 请提出你的问题 Please ask your question - PaddlePaddle 2.4.0rc0 - Ubuntu 18.04 参考的文档：https://www.paddlepaddle.org.cn/documentation/docs/zh/guides/performance_improving/amp_cn.html 使用的模型包括deepsp…

yeyupiaoling updated 1 year ago
12
yairm210/Unciv #8627

1 Air sweeps anywhere causes the game to crash

**Platform** Android **Version** 4.4.11 ver 808 from Google play store. **Describe the bug** Air sweep = crash page **To Reproduce** Basically choose air sweep and it'll crash.

kazewolf1 updated 1 year ago
7
apache/tvm #12102

TVM v0.9.0.rc0 Release Candidate Notes

**Please leave any comments or edit this issue directly to adjust the release notes! Also see the rc0 vote thread in #12103**. # Introduction The TVM community has worked since the v0.8 release …

driazati updated 2 years ago
0
NVIDIA/cutlass #293

How to use CUTLASS GEMM for small matrix

Hi, I would like to execute a CUTLASS GEMM (A*B+C) that uses the Tensor Cores on my Volta architecture with : - matrix A size = 6x123 - matrix B size = 64x6 - matrix C size = 64x123 So, it …

ju9379 updated 2 years ago
21
tlc-pack/tvm-tensorir #439

[BUG][MetaSchedule] Cuda Tensorcore Integration Test Error

While running `test_integration_cuda_tensorcore.py`, I got the following error. ``` @tvm.script.tir class Module: def main(var_A: ty.handle, var_B: ty.handle, var_C: ty.handle) -> None: …

zxybazh updated 3 years ago
1

上一页 1...1 2 3 4 5 6 7...7 下一页

69 results for wmma-api

69 results
for wmma-api