-
RFE: extend `cuda::std::numeric_limits` to the CUDA `__half` type. This should be legal according to cppreference as an extension of numeric_limits to an implementation-defined type.
-
It seems that when the MG setup is refreshed for a particular monomial, the subsequent solve for another monomial (with `tm_rho=0`) fails to converge.
I wonder if running the refresh
https://git…
-
https://github.com/lattice/quda/blame/d571552ff32465c7a50aaf39ce0b8c7497e000e8/include/kernels/dslash_gamma_helper.cuh#L80
-
We can choose three different conventions while using `staggeredPhaseQuda` to apply the staggered phase to a gauge field. `QUDA_STAGGERED_PHASE_CPS` should apply the CPS software's staggered phase, wh…
-
A STRICT build using sm_86 with MULTIGRID on fails with:
Building CUDA object lib/CMakeFiles/quda.dir/dslash_mdw_fused_ls20.cu.o
ptxas error : Value of threads per SM for entry _ZN4quda10raw_kerne…
-
Right now, we do not really care about the block-size. But especially when it comes to multiple RHS it will matter. We should implement an autotuner like quda has. One has to think of a good design. T…
-
Reproducer:
```
> ./staggered_dslash_test --test Mat
[...]
[ RUN ] StaggeredDslashTest.verify
Sending fat links to GPU
Sending long links to GPU
running the following test:
prec recon …
-
Hi all,
I found that I just need use the performWuppertalnStep in the interface_quda.cpp from chroma to do the gaussian smearing I need. So the things I need to do are just passing the correct para…
-
Hi All,
I have several reports about folks coming accross this error:
```
Initializing QUDA device (using CUDA device no. 0)
ERROR: Current communicator can't be found. (rank 2, host frontier0…
-
From Carleton:
```
Hi Balint,
Yesterday afternoon I built QUDA/develop on Frontier (apparently)
successfully. Steve reported compilation errors when he did that
later. So I tried a new pull…