Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
GNU General Public License v3.0
93
stars
10
forks
source link
Device to device pipeline: optimize single stage multiple kernel compute with less synchronizations #35
use Cores class' "single sync multi kernel execution" feature if all stage kernels use same global and local range values
or
add "single sync multi kernel with multi range values" feature