This finally adds nestable barriers under the name syncScope that await a whole scope and all tasks and descendants created within.
It solves #31 and allows working around #97.
Concretely, this allows porting any OpenMP-based algorithm in a straightforward way and allow nested parallelism (unlike OpenMP).
The gemm example had a nestable version. It can be called from a parallel batched matrix multiplication or batched convolutions.
This finally adds nestable barriers under the name
syncScope
that await a whole scope and all tasks and descendants created within.It solves #31 and allows working around #97.
Concretely, this allows porting any OpenMP-based algorithm in a straightforward way and allow nested parallelism (unlike OpenMP). The gemm example had a
nestable
version. It can be called from a parallel batched matrix multiplication or batched convolutions.