tugrul512bit / Cekirdekler

Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
GNU General Public License v3.0
93 stars 10 forks source link

Device to device pipeline: optimize single stage multiple kernel compute with less synchronizations #35

Closed tugrul512bit closed 7 years ago

tugrul512bit commented 7 years ago

use Cores class' "single sync multi kernel execution" feature if all stage kernels use same global and local range values

or

add "single sync multi kernel with multi range values" feature