-
### Problem Description
I am investigating usage of instruction v_mfma_f32_16x16x16_f16 and nvidia equivalent warp-level mma (swizzle SRAM memory + ldmatrix registers + mma over registers, for Ampere…
-
Is there a way to get the output stream?
I'd like to buffer some video for manual playing, currently I use `--sout #std{mux=ts,access=file,dst=-}` to output the data to the console and capture it m…
-
@tridao
Hello!
I am currently working with the selective scan forward kernel, specifically the step h_t = A*h_{t-1} + Bx, where h_0 is currently set to 0. I would like to modify this behavior to …
-
in your matmul-pingpong-v1.cu,the code:
`/// make shared memory descriptor
template
DEVICE GmmaDescriptor make_smem_desc(PointerType smem_ptr) {
GmmaDescriptor desc;
uint32_t uint_ptr = stat…
-
In many CUDA related project, we can see the line of code as the following:
```
extern "C" __device__ uint32_t __nvvm_get_smem_pointer(void *ptr);
```
It is used to convert the shared memory addre…
-
The following scripts need `root` privs to run `smem`:
- `tests/metrics/density/docker_memory_usage.sh.in`
- `tests/metrics/smem_monitor.sh`
These scripts should contain a check like the follow…
-
the `smem_start` in `fb_fix_screeninfo` is `unsigned long` type, you should use `%lu` instead `%d` in `printf` for `unsigned long`. Don't you notice the value of `smem_start` in your output is a negat…
-
CHIRRTL memories were supposed to have an `undefined`, `old`, or `new` read-under-write modifiers after the memory declaration. E.g.:
```
smem mem1 : UInt [8] old
smem mem2 : UInt [8] new
smem m…
-
HI
I am trying to map some WGS with bwa-mem2 and get this error after ~25 min of mapping :
bwa-mem2: src/bwamem.cpp:949: int mem_kernel1_core(const mem_opt_t *, const bntseq_t *, const unsigned …
-
```
Currently CSoar does not support multiple agents in the same database nor does
it support epmem and smem in the same database. This is a feature request for
both of these. Theoretically smem a…