-
**What is your question?**
I've read the example 59, it seems there is a easy and elegant way to assemble a conv kernel by using Cute, but the conv params are assumed to be known at complie time, and…
-
-
Box2C has refactored it's API to be simpler using structs and handles. The collision routines are easily accessible and callable without any setup or fuss. We should remove cute_c2.h and replace it wi…
-
리핀is cute
why?
-
Hi:
I'd like to test FP8 in RTX 4090. I can find some BF16 functions like SM80_16x8x8_F32BF16BF16F32_TN in cutlass/include/cute/arch/mma_sm80.hpp, however, I can't find some FP8 functions like SM80_1…
-
![image](https://user-images.githubusercontent.com/1121921/65662396-3d07f980-e002-11e9-91c9-6e75d95b7526.png)
-
I run the example in the quick start guide.
My GPU is A30, the command is `nvcc 01_gemm_3.0.cu -arch=sm_80`
It complains errors:
```
01_gemm_3.0.cu(51): error: too few arguments for class templ…
-
Whenever I try to add it to cute chess it says cannot run engine and doesn't add it why does this error comes .. I use python main.py in the command dialogue box and the directory of bot files in work…
-
**What is your question?**
```
auto sA_layout = make_layout(make_shape(bM, bK)); // (m,k) -> smem_idx; m-major
ThrCopy thr_copy_a = copy_a.get_slice(threadIdx.x);
Tensor sA = make_t…
-
Im cute