autumnai / collenchyma

Extendable HPC-Framework for CUDA, OpenCL and common CPU
http://autumnai.github.io/collenchyma
Apache License 2.0
475 stars 33 forks source link

Refactor synchronization #62

Open alexandermorozov opened 8 years ago

alexandermorozov commented 8 years ago

I've implemented memory access API and syncronization based on bitmasks. Tesnsor/TensorView and decoupling aren't implemented.

Native and CUDA pass all tests. OpenCL compiles but segfaults on my machine, both with this PR and without it.

PR isn't ready to be merged yet -- I'd like to fix plugins and Leaf first to see that there are no unexpected problems.