cp2k / dbcsr

DBCSR: Distributed Block Compressed Sparse Row matrix library
https://cp2k.github.io/dbcsr/
GNU General Public License v2.0
134 stars 46 forks source link

ocl: updated tuned parameters, improved tuner, adjusted kernel #731

Closed hfp closed 8 months ago

hfp commented 8 months ago

tune_multiply.py

Tuning on multiple devices (tune_multiply.py)

kernels/multiply.cl

Documentation