issues
search
cp2k
/
dbcsr
DBCSR: Distributed Block Compressed Sparse Row matrix library
https://cp2k.github.io/dbcsr/
GNU General Public License v2.0
135
stars
47
forks
source link
ocl: improved determinism for small SMM sequences/batches
#548
Closed
hfp
closed
2 years ago
hfp
commented
2 years ago
Implemented cl_cache_dir (see
https://github.com/intel/compute-runtime/blob/master/opencl/doc/FAQ.md#feature-cl_cache
).
Introduced opencl_libsmm_timer_t (opencl_libsmm_timer_device, opencl_libsmm_timer_device).
Introduced environment variable OPENCL_LIBSMM_TIMER (device|host or 0|1).
Made some code optional (ACC_OPENCL_CPPBIN, ACC_OPENCL_SEDBIN).
Enable ACC_OPENCL_CACHEDIR in case of __DBCSR_ACC.
Fixed regex (tune_multiply.sh).
Adjusted some default values.
Updated LIBXSMM for Daint-CI.
Code cleanup.