-
This will require reworking gauge_quda.cpp into a similar form as the ColorSpinorField, with derived classes for cpu or cuda. This must allow:
1. Arbitrary number of colours (though Nc=3 will continu…
-
The current interface requires that the host-side spinor fields are in chiral basis, and the conversion to non-relativistic spinor fields is done when the field copied to the device. Additionally, th…
-
Using multiple GPUs in anything other than the T dimension seems to get the wrong answer, and the wilson_dslash_test fails, e.g.,
```
$mpirun -np 2 ./wilson_dslash_test --xdim 16 --ydim 16 --zdim 8 -…
-
When building staggered in single GPU mode (create make.inc with
./configure --enable-os=linux --enable-gpu-arch=sm_20 --enable-staggered-dirac --disable-wilson-dirac --disable-domain-wall-dirac --d…
bjoo updated
13 years ago
-
When building staggered with --with-qmp= ... flag in multi-gpu mode
I get unresolved symbols (see at end of message)
Issue is not present when compiling pure MPI (--with-mpi=... , but no --with-qmp)
…
bjoo updated
13 years ago
-
multiple calls to loadGaugeQuda produce
in the minvcg branch, when not using mixed precision, in multi-GPU mode, multiple calls to loadGaugeQuda can elicit the error:
QUDA error: (CUDA) invalid arg…
bjoo updated
13 years ago
-
Hi Folks,
I've been trying to build QUDA with staggered disabled. Indeed looking at the compile line:
/usr/local/cuda/bin/nvcc -O3 -D__CUDA_ARCH__=200 -ftz=true -prec-div=false -prec-sqrt=false -D…
bjoo updated
13 years ago
-
Similar to what Guochun added for the staggered kernels. This should have the option to perform non-texture reads (i.e., through the L1 on Fermi) on the gauge fields and/or the spinor fields.
-
I just pulled the master and tried to run make tune. The default tests/blas_test.cu has
// volume per GPU
const int LX = 12; // Has to be checkerboarded value... (so 24->12)
const int LY = 24;
const…
bjoo updated
13 years ago
-
Multiple calls to loadGaugeQuda() cause the error message:
QUDA error: Error: even/odd field is not null, probably already allocated(even=0x54a0000, odd=0x54a7800)
(this is pernicious when wrapping …
bjoo updated
13 years ago