DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems. It is designed to deliver sustained performance for distributed systems where each node featuring multiple sockets of multicore processors, and if available, accelerators, using the PaRSEC runtime as a backend.
Describe the bug
PaRSEC profiles READ tasks in GEMM.
Profiling is 'off' in the JDF, for this task class
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Program runs to completion without profiling READ tasks.
Environment (please complete the following information):
Additional context
READ_C and WRITE_C tasks were introduced to adapt the code for inter-node task migration. Changes were also made to the data flow types.