PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core heterogeneous architectures. PaRSEC assigns computation threads to the cores, GPU accelerators, overlaps communications and computations and uses a dynamic, fully-distributed scheduler based on architectural features such as NUMA nodes and algorithmic features such as data reuse.
A mechanism for the user should be provided that allows to interrupt the (distributed) execution of a handle.
This could be done in a way similar to what Shawn did for the fault-tolerance: trigger a new DAG that takes in argument the handle to interrupt and broadcasts the information that the local number of tasks for that handle passed as a parameter should be set to 0.
Original report by Thomas Herault (Bitbucket: herault, GitHub: therault).
A mechanism for the user should be provided that allows to interrupt the (distributed) execution of a handle.
This could be done in a way similar to what Shawn did for the fault-tolerance: trigger a new DAG that takes in argument the handle to interrupt and broadcasts the information that the local number of tasks for that handle passed as a parameter should be set to 0.