-
"-g" or "-O3" is only for the NVCCFLAGS. The options in CFLAGS/CXXFLAGS does not affect the compute results.
Specifically the problem comes from the staple computing function and can be reproduced e…
gshi updated
11 years ago
-
This is an irksome interface issue, which I'm sure will be dealt with once the SC rush is over. But I spent an hour trying to understand why my code was giving an error, so I feel like raising the is…
-
There are a few things at the end of quda.h that should be moved into quda_internal.h or somewhere else: CUERR, PRINTF, and "extern int verbose".
"verbose" itself should probably either be wrapped in…
-
/usr/local/cuda/bin/nvcc -O3 -DPOINTER_SIZE=8 -D__COMPUTE_CAPABILITY__=200 -ftz=true -prec-div=false -prec-sqrt=false -DMULTI_GPU -DOVERLAP_COMMS -DGPU_STAGGERED_DIRAC -DGPU_GAUGE_FORCE -DGPU_DIRECT …
gshi updated
12 years ago
-
Assigning Mike...
-
QUDA should have the feature to conveniently bind to numa-optimized cpu/gpu. Here is my thought so far
*) we can add a separate c program to generate a numa mapping file. I already have such a C prog…
gshi updated
12 years ago
-
Hi, a user was trying to run QUDA and came accross this error:
(CUDA) too many resources requested for launch (node 0, blas_quda.cu:929)
He was trying to run a 16^4 clover lattice on a single C2050 …
bjoo updated
12 years ago
-
I get a nice segfault when executing chroma with built-in quda support:
Initialize done
Initializing QUDA device: 0
QUDA: Found device 0: Tesla C2070
QUDA: Found device 1: Tesla C2070
QUDA: Found dev…
-
During QUDA clover BiCGStab inversion the following happens:
BiCGstab: 250 iterations, r2 = 9.738363e-08
BiCGstab: 251 iterations, r2 = 1.270892e-07
QUDA error: (CUDA) too many resources requested fo…
-
Hi,
running on 2 hosts with 1 mpi node per host does not work for me. Whereas running on 1 host with 2 mpi nodes works fine.
I tracked it down to
face_qmp.cpp: void FaceBuffer::exchangeCpuLink(void…