lattice / quda

QUDA is a library for performing calculations in lattice QCD on GPUs.
https://lattice.github.io/quda
Other
295 stars 100 forks source link

Kernel fusion for 4-d preconditioned domain-wall and Mobius kernels #183

Open maddyscientist opened 9 years ago

maddyscientist commented 9 years ago

The 4-d preconditioned chiral fermion dslashes presently use multiple kernels to apply the preconditioned dslash. This is an issue to explore fusing these kernels, the motivation is two fold:

  1. This will improve single node performance since it will reduce memory traffic.
  2. If the 4-d dslash part is fused with other operations it will make the strong scaling better since there will be more compute to overlap with.
weinbe2 commented 3 months ago

Mobius is covered by https://github.com/lattice/quda/pull/1163 , I guess 4-d preconditioned Shamir is still outstanding