-
I am trying to compile the following CUB test program -
```
#include
int main()
{
uint32_t* d_samples;
uint8_t* d_histogram;
uint8_t* d_levels;
size_t temp…
-
Hi @66RING, thank you for your helpful work.
I have one question about the use of kBlockKSmem in csrc/kernel_traits.h. When you define SmemLayoutAtomQ:
```
using SmemLayoutAtomQ = decltype(
…
-
Hi, first Thank You for a fantastic new release **0.7.2** with newly added **PSS** memory metrics, which is really cool and useful !
**Problem:**
We upgraded process-exporter on all of our machine…
itsx updated
2 months ago
-
Recent versions of Petitboot are setting the physical address of the framebuffer, on the device tree, to a wrong value (assigned-addresses property of vga node).
This was seem on a Talos II machine, …
-
The disassembler thinks that a SMEM instruction that uses s106 and/or 107 is invalid, and same with VMEM instructions. But in fact these should work.
-
### Problem Description
I am investigating usage of instruction v_mfma_f32_16x16x16_f16 and nvidia equivalent warp-level mma (swizzle SRAM memory + ldmatrix registers + mma over registers, for Ampere…
-
**Describe the bug**
Using DefaultCopy on A100 implicitly generates the unexpected LDGSTS. Users are not aware of the need to commit and wait.
**Steps/Code to reproduce bug**
```
using GmemTile…
cctry updated
8 months ago
-
**Describe the bug**
silhouette_score will lead to CUDA error when running check_labels on larger array sizes.
**Steps/Code to reproduce bug**
```
import numpy as np
from cuml.metrics.cluster i…
goraj updated
4 months ago
-
Here's the message:
```
C:\Users\pwu\Desktop\later\cmake-build-debug\test\test_qr.exe 2 8192 8192 -check
=== Device information ===
Device name: GeForce RTX 2060
Compute Capability: 7.5
OS: W…
-
其中4.1.0上电即异常,打印输出如下:
[09:48:22.840]收←◆
\ | /
- RT - Thread Operating System
/ | \ 4.1.0 build Aug 31 2023 09:48:05
2006 - 2022 Copyright by RT-Thread team
(rt_object_get_type(&m->pa…