DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems. It is designed to deliver sustained performance for distributed systems where each node featuring multiple sockets of multicore processors, and if available, accelerators, using the PaRSEC runtime as a backend.
Other
10
stars
8
forks
source link
zpotrf_dtd fixed for the GPU, updated kernels to cublas_v2, although results still suspicious #94
testing_zpotrf_dtd now works with GPU. However, the factorisation and results produced are suspicious